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(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1457 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 115.. 1326 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

TGTTGAGTCA ACAGCTGTAG ATACACCAAT TGTTGCCGAT TTCTTTCTTT TCGACTGTCG 60 

GCTTCTCGCG AAACTGTGAT TGTGAAAATT GTACAAATAG AGGCAAATTT AACC ATG 117 

Met 
1 

GCG CAC ATG TCC CAC ATG CTC CAG CAG CCT TCG GGG TCG ACG CCC TCC 165 
Ala His Met Ser His Met Leu Gin Gin Pro Ser Gly Ser Thr Pro Ser 
5 10 15 

AAC GTG GGC TCC AGC TCA TCG CGC ACG ATG TCC CTG ATG GAG AAA CAA 213 
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Asn val Gly Ser ser ser Ser Arg Thr Met Ser Leu Met Glu Lys Gin 
20 25 30 

AAG TAC ATC GAG GAC TAC GAC TTT CCC TAC TGC GAC GAG AGC AAC AAA 261 

Lys Tyr lie Glu Asp Tyr Asp Phe Pro Tyr Cys Asp Glu Ser Asn Lys 
35 40 45 

TAC GAA AAG GTG GCG AAA ATT GGC CAA GGC ACC TTC GGA GAG GTT TTT 309 

Tyr Glu Lys val Ala Lys lie Gly Gin Gly Thr Phe Gly Glu Val Phe 
50 55 60 65 

AAG GCT CGC GAG AAA AAG GGC AAC AAG AAG TTT GTG GCC ATG AAG AAG 357 
Lys Ala Arg Glu Lys Lys Gly Asn Lys Lys Phe val Ala Met Lys Lys 
70 75 80 

GTG CTG ATG GAC AAC GAA AAG GAG GGC TTT CCC ATC ACG GCT CTG CGA 405 
Val Leu Met Asp Asn Glu Lys Glu Gly Phe Pro lie Thr Ala Leu Arg 
85 90 95 

GAG ATC CGC ATC CTG CAG CTG CTA AAG CAC GAG AAC GTG GTG AAT CTG 453 
Glu lie Arg lie Leu Gin Leu Leu Lys His Glu Asn Val Val Asn Leu 
100 105 110 

ATC GAG ATC TGC CGC ACC AAG GCC ACC GCC ACG AAT GGT TAC AGA TCC 501 
lie Glu lie Cys Arg Thr Lys Ala Thr Ala Thr Asn Gly Tyr Arg Ser 
115 120 125 

ACC TTC TAT TTG GTC TTT GAT TTC TGC GAA CAC GAT TTG GCA GGT CTT 549 

Thr Phe Tyr Leu val Phe Asp Phe Cys Glu His Asp Leu Ala Gly Leu 
130 135 140 145 

CTG TCC AAC ATG AAC GTC AAG TTC AGT CTG GGC GAG ATT AAG AAG GTT 597 

Leu Ser Asn Met Asn val Lys Phe Ser Leu Gly Glu lie Lys Lys val 
150 155 160 

ATG CAG CAG CTT TTA AAC GGT TTG TAT TAC ATC CAC AGC AAC AAG ATC 645 

Met Gin Gin Leu Leu Asn Gly Leu Tyr Tyr lie His Ser Asn Lys lie 
165 170 175 

CTG CAC CGA GAC ATG AAA GCT GCC AAC GTG CTG ATT ACC AAG CAT GGC 693 

Leu His Arg Asp Met Lys Ala Ala Asn val Leu lie Thr Lys His Gly 
180 185 190 

ATC TTA AAG CTG GCT GAC TTT GGC TTG GCC CGT GCT TTT AGC ATT CCA 741 

lie Leu Lys Leu Ala Asp Phe Gly Leu Ala Arg Ala Phe Ser lie Pro 
195 200 205 

AAG AAC GAG AGT AAG AAT CGC TAT ACC AAT CGC GTA GTA ACC TTG TGG 789 

Lys Asn Glu Ser Lys Asn Arg Tyr Thr Asn Arg val val Thr Leu Trp 
210 215 220 225 

TAC CGG CCG CCT GAG CTG CTA CTT GGT GAC CGC AAC TAT GGT CCA CCC 837 
Tyr Arg Pro Pro Glu Leu Leu Leu Gly Asp Arg Asn Tyr Gly Pro Pro 
230 235 240 

GTG GAC ATG TGG GGA GCC GGC TGC ATA ATG GCC GAG ATG TGG ACA CGC 885 
val Asp Met Trp Gly Ala Gly Cys lie Met Ala Glu Met Trp Thr Arg 
245 250 255 

TCG CCC ATC ATG CAA GGC AAT ACG GAG CAG CAG CAG TTA ACC TTT ATT 933 
Ser Pro He Met Gin Gly Asn Thr Glu Gin Gin Gin Leu Thr Phe lie 
260 265 270 
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TCG CAG CTA TGC GGC TCC TTT ACG CCG GAC GTG TGG CCG GGA GTG GAG 981 

Ser Gin Leu Cys Gly Ser Phe Thr Pro Asp val Trp Pro Gly val Glu 
275 280 285 

GAG CTG GAG CTG TAC AAA TCC ATC GAG CTG CCA AAG AAC CAG AAG CGT 1029 

Glu Leu Glu Leu Tyr Lys Ser lie Glu Leu Pro Lys Asn Gin Lys Arg 
290 295 300 305 

CGA GTC AAG GAG CGC CTG CGT CCG TAT GTC AAG GAT CAA ACC GGC TGT 1077 
Arg val Lys Glu Arg Leu Arg Pro Tyr val Lys Asp Gin Thr Gly Cys 
310 315 320 

GAT CTA TTG GAC AAA TTG CTG ACC CTT GAT CCC AAG AAA CGC ATC GAT 1125 

Asp Leu Leu Asp Lys Leu Leu Thr Leu Asp Pro Lys Lys Arg lie Asp 
325 330 335 

GCG GAC ACA GCT CTG AAT CAC GAC TTC TTC TGG ACG GAT CCC ATG CCC 1173 
Ala Asp Thr Ala Leu Asn His Asp Phe Phe Trp Thr Asp Pro Met Pro 
340 345 350 

AGC GAC TTG AGC AAG ATG CTG TCC CAG CAC CTG CAG AGC ATG TTC GAG 1221 

Ser Asp Leu Ser Lys Met Leu Ser Gin His Leu Gin ser Met Phe Glu 
355 360 365 

TAC CTG GCG CAG CCA CGC CGC AGC AAC CAG ATG CGC AAC TAT CAC CAG 1269 
Tyr Leu Ala Gin Pro Arg Arg Ser Asn Gin Met Arg Asn Tyr His Gin 
370 375 380 385 

CAA CTG ACC ACC ATG AAC CAG AAG CCC CAG GAC AAC AGT ATG ATT GAC 1317 

Gin Leu Thr Thr Met Asn Gin Lys Pro Gin Asp Asn Ser Met lie Asp 
390 395 400 

CGG GTT TGG TAGACTGCCA GAGGTGTACG CACCCGACTA ATAGTTTCTC 1366 

Arg val Trp 

ACCTTCAACT AGCGTTAGGT TATTAGGTTA GTGTACAATA AAAATATTGG CATTTGCATT 1426 
AGCGCTTGCT CCAAATATAA AAAAAAAAAA A 1457 

(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 404 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE type: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Ala His Met Ser His Met Leu Gin Gin Pro Ser Gly Ser Thr Pro 
15 10 15 

Ser Asn val Gly Ser ser Ser Ser Arg Thr Met Ser Leu Met Glu Lys 
20 25 30 

Gin Lys Tyr lie Glu Asp Tyr Asp Phe Pro Tyr Cys Asp Glu Ser Asn 
35 40 45 

Lys Tyr Glu Lys val Ala Lys lie Gly Gin Gly Thr Phe Gly Glu val 
50 55 60 
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Phe Lys Ala Arg Glu Lys Lys Gly Asn Lys Lys Phe val Ala Met Lys 
65 70 75 80 

Lys val Leu Met Asp Asn Glu Lys Glu Gly Phe Pro lie Thr Ala Leu 
85 90 95 

Arg Glu lie Arg lie Leu Gin Leu Leu Lys His Glu Asn val val Asn 
100 105 110 

Leu lie Glu lie Cys Arg Thr Lys Ala Thr Ala Thr Asn Gly Tyr Arg 
115 120 125 

Ser Thr Phe Tyr Leu val Phe Asp Phe Cys Glu His Asp Leu Ala Gly 
130 135 140 

Leu Leu Ser Asn Met Asn val Lys Phe Ser Leu Gly Glu lie Lys Lys 
145 150 155 160 

val Met Gin Gin Leu Leu Asn Gly Leu Tyr Tyr lie His Ser Asn Lys 
165 170 175 

lie Leu His Arg Asp Met Lys Ala Ala Asn val Leu lie Thr Lys His 
180 185 190 

Gly lie Leu Lys Leu Ala Asp Phe Gly Leu Ala Arg Ala Phe Ser lie 
195 200 205 

Pro Lys Asn Glu Ser Lys Asn Arg Tyr Thr Asn Arg val val Thr Leu 
210 215 220 

Trp Tyr Arg Pro Pro Glu Leu Leu Leu Gly Asp Arg Asn Tyr Gly Pro 
225 230 235 240 

Pro val Asp Met Trp Gly Ala Gly Cys lie Met Ala Glu Met Trp Thr 
245 250 255 

Arg Ser Pro lie Met Gin Gly Asn Thr Glu Gin Gin Gin Leu Thr Phe 
260 265 270 

lie Ser Gin Leu Cys Gly Ser Phe Thr Pro Asp val Trp Pro Gly val 
275 280 285 

Glu Glu Leu Glu Leu Tyr Lys Ser lie Glu Leu Pro Lys Asn Gin Lys 
290 295 300 

Arg Arg Val Lys Glu Arg Leu Arg Pro Tyr Val Lys Asp Gin Thr Gly 
305 310 315 320 

Cys Asp Leu Leu Asp Lys Leu Leu Thr Leu Asp Pro Lys Lys Arg lie 
325 330 335 

Asp Ala Asp Thr Ala Leu Asn His Asp Phe Phe Trp Thr Asp Pro Met 
340 345 350 

Pro Ser Asp Leu Ser Lys Met Leu Ser Gin His Leu Gin Ser Met Phe 
355 360 365 

Glu Tyr Leu Ala Gin Pro Arg Arg Ser Asn Gin Met Arg Asn Tyr His 
370 375 380 

Gin Gin Leu Thr Thr Met Asn Gin Lys Pro Gin Asp Asn ser Met lie 
385 390 395 400 

Page 4 



Asp Arg val Trp 



4200.000200 



(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4328 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



CAGCCCTGCC 


GACGGCCATA 


CTTGAAAATA 


CA 1 1 1 1 II 1 C 


TGCAAAGTTT 


GTCATTGTCA 


60 


CTGTGTGAAT 


GGAATCTGTG 


ATGTGTTGTG 


GAATTAAAAA 


CGTCAAGTAA 


ACAACCCGTA 


120 


ATGGTTAAAG 


TGCACGGCGA 


AAGCAGTGCG 


AATAACTATG 


AATTGATACA 


AAAGTTGCAT 


180 


AACACGTCGC 


CTGGTGTCGC 


GGTTAGTGTG 


1 1 1 1 1 CGTCT 


CG 1 1 1 CG 1 1 1 


CCGCCGCAGT 


240 


CGCAGTTTCC 


AAAAAACCTC 


ACCACACCAT 


ACCATCTCCA 


CCACGCACAC 


ACACACACAA 


300 


ACAAACACGC 


AGAGACGCGG 


CGGCGGAAAA 


AGTGTGCGGA 


CCGCGGA 1 1 1 


AACCCCTCGT 


360 


TCCAAACCCA 


AATTGGAGTC 


TCCCAAAAAC 


AGCGAAATAT 


CGAGTGTGGC 


TTAGCCGATG 


420 


TGCCGTGCGA 


TCCCCACTGC 


CCCTTCCGTA 


CCGCTGCCAC 


CCCCGCCACA 


GCAGCAACGC 


480 


ACACGGATAC 


GGACACAGAC 


ACCAATACCA 


GCGCACTCAA 


GCACGGCCGA 


CAAAGAAAGA 


540 


GCGCTCTCCC 


TTCC ICING 


TACAGTTAGT 


TCCTACAGCT 


GAATCAGCCA 


AAAGAAATTA 


600 


CTAGGTCCAT 


TCCGAGGCGC 


AG 1 1 1 GCATG 


TGAAACGGAG 


GTCCCCGCAT 


AACCACGCGG 


660 


AACCCGAAAT 


TCCAGATCCC 


CATCTCCGCT 


GCACGGATAA 


AGGAAACATA 


CAACCATGAG 


720 


TCTCCTAGCC 


ACGCCAATGC 


CCCAGGCGGC 


CACCGCCTCA 


TCTTCTTCAT 


CCGCCTCCGC 


780 


GGCCGCCTCG 

VJ VJ \#\«\J vVp< 1 VJ 


GCCAGCGGGA 

VI V— V— VJ V. VJ VJ 


TTCCAATCAC 


CGCCAACAAC 


AACCTGCCTT 


TCGAGAAGGA 

1 V* VJ/l VJ/v/% VJ VJ^\ 


840 


CAAGATCTGG 


TACTTCAGCA 


ACGATCAGCT 


GGCCAATTTG 


CCAAGCAGAA 


GATGCGGCAT 


900 


CAAGGGCGAC 


GATGAGCTGC 


AGTACCGCCA 


GATGACCGCC 


TATCTGATAC 


AGGAAATGGG 


960 


TCAGCGTCTG 


CAGGTGTCCC 


AACTGTGCAT 


CAACACGGCC 


ATTGTGTACA 


TGCATCGGTT 


1020 


CTACGCC 1 1 1 


CACTCCTTCA 


CCCACI 1 ICA 


TCGCAACTCC 


ATGGCGTCGG 


CGAGCCTCTT 


1080 


CTTGGCCGCC 


AAGGTAGAAG 


AGCAACCGCG 


GAAGCTGGAG 


CATGTTATTC 


GGGCCGCCAA 


1140 


CAAGTGCCTG 


CCGCCGACCA 


CCGAGCAGAA 


TTACGCCGAA 


CTCGCCCAGG 


AGCTTGTGTT 


1200 


CAACGAGAAC 


GTGCTCCTGC 


AGACGCTGGG 


CTTCGATGTG 


GCCATCGATC 


ATCCGCACAC 


1260 


GCATGTGGTG 


CGCACCTGCC 


AGCTGGTCAA 


AGCATGCAAG 


GATCTGGCGC 


AGACATCGTA 


1320 


CTTCTTGGCC 


TCGAACAGCC 


TGCATCTGAC 


CTCGATGTGC 


CTCCAATATC 


GCCCCACGGT 


1380 


CGTAGCCTGT 


TTCTGCATTT 


ACCTAGCCTG 


CAAGTGGTCC 


CGATGGGAGA 


TCCCCCAGTC 


1440 
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GACCGAGGGC 


AAGCACTGGT 


TCTACTATGT 


GGACAAGACG 


GTCTCGCTGG 


ATTTGCTAAA 


1500 


GCAGCTGACA 


GATGAGTTCA 


TCGCTATCTA 


TGAGAAGAGC 


CCGGCCCGTC 


TGAAGTCTAA 


1560 


GCTTAACTCG 


ATCAAGGCGA 


TCGCCCAGGG 


AGCCAGCAAT 


CGGACAGCTA 


ACAGCAAGGA 


1620 


CAAACCAAAG 


GAGGACTGGA 


AGATCACCGA 


GATGATGAAG 


GGCTACCACT 


CAAACATCAC 


1680 


GACACCACCA 


GAGCTGTTAA 


ACGGCAACGA 


CAGCCGGGAT 


CGGGACCGAG 


ATCGTGAACG 


1740 


GGAGAGAGAG 


CGGGAACGGG 


ATCCGTCGTC 


ACTACTGCCG 


CCACCGGCTA 


TGGTGCCGCA 


1800 


GCAAAGACGA 


CAGGATGGTG 


GACATCAGCG 


CTCGTCCTCA 


GTGAGCGGAG 


TGCCAGGCAG 


1860 


CAGCTCTTCG 


TCGTCTTCCT 


CCAGTCACAA 


GATGCCAAAT 


TACCCTGGTG 


GCATGCCGCC 


1920 


CGAAGCTCAT 


CCGGATCACA 


AGTCAAAGCA 


GCCGGGCTAT 


AACAATCGAA 


TGCCCTCAAG 


1980 


TCACCAGCGT 


AGTAGTAGCA 


GTGGACTCGG 


TTCCTCGGGA 


AGTGGCAGCC 


AGCACAGCAG 


2040 


CTCATCCTCG 


TCGTCTTCAA 


GCCAGCAGCC 


TGGCCGACCG 


TCTATGCCCG 


TGGACTATCA 


2100 


CAAATCCTCT 


CGCGGCATGC 


CGCCGGTAGG 


CGTGGGCATG 


CCACCTCACG 


GCAGCCACAA 


2160 


GATGACTTCG 


GGCTCCAAGC 


CTCAACAGCC 


GCAGCAGCAG 


CCGGTCCCAC 


ATCCATCCGC 


2220 


CTCTAATTCC 


TCTGCATCGG 


GCATGTCCTC 


CAAGGATAAA 


TCCCAGAGCA 


ACAAAATGTA 


2280 


TCCGAACGCA 


CCGCCGCCAT 


ACAGTAATAG 


TGCCCCTCAA 


AACCCGCTGA 


TGTCGCGTGG 


2340 


TGGATATCCA 


GGCGCTAGCA 


ATGGATCCCA 


GCCCCCGCCT 


CCCGCCGGAT 


ACGGCGGCCA 


2400 


TCGCAGCAAA 


TCCGGCTCCA 


CCGTCCATGG 


CATGCCGCAT 


TTCGAGCAGC 


AATTGCCCTA 


2460 


TTCCCAGAGC 


CAGAGCTACG 


GCCACATGCA 


GCAGCAGCCA 


GTGCCTCAGT 


CTCAGCAGCA 


2520 


ACAGATGCCT 


CCGGAGGCAT 


CCCAGCACTC 


GTTGCAGTCC 


AAGAACTCGC 


TCTTCAGTCC 


2580 


AGAGTGGCCA 


GACATTAAAA 


AGGAGCCCAT 


GTCGCAGTCG 


CAACCACAGC 


1 1 1 1 IAACGG 


2640 


TTTGCTACCC 


CCTCCTGCGC 


CTCCCGGCCA 


CGATTACAAG 


CTAAATAGCC 


ATCCGCGCGA 


2700 


CAAAGAAAGT 


CCCAAGAAAG 


AGCGACTAAC 


GCCAACCAAA 


AAGGATAAGC 


ACCGTCCTGT 


2760 


AATGCCCCCA 


ATGGGCAGTG 


GGAACAGTTC 


CTCCGGCTCG 


GGATCATCAA 


AGCCGATGCT 


2820 


ACCGCCTCAC 


AAGAAGCAGA 


TACCCCATGG 


CGGGGACCTG 


TTGACCAATC 


CTGGAGAGAG 


2880 


TGGAAGCCTA 


AAACGGCCCA 


ACGAGATCTC 


GGGAAGTCAG 


TATGGACTAA 


ATAAGCTGGA 


2940 


TGAAATAGAT 


AACAGTAATA 


TGCCTCGAGA 


AAAGCTTCGC 


AAGCTGGACA 


CTACAACTGG 


3000 


ACTACCAACT 


TATCCGAATT 


ATGAGGAGAA 


ACACACGCCT 


CTGAATATGT 


CCAACGGAAT 


3060 


CGAGACAACG 


CCGGATCTGG 


TGCGCAG 1 1 1 


GCTAAAGGAG 


AGTCTGTGTC 


CATCGAACGC 


3120 


TTCGCTCCTG 


AAACCGGATG 


CCTTGACTAT 


GCCTGGCCTG 


AAACCACCGG 


CCGAACTACT 


3180 


TGAGCCCATG 


CCCGCACCAG 


CGACAATCAA 


GAAAGAACAG 


GGAATAACTC 


CGATGACCAG 


3240 


TTTGGCTAGT 


GGGCCCGCAC 


CCATGGATTT 


GGAAGTACCC 


ACTAAACAGG 


CCGGAGAGAT 


3300 


TAAGGAGGAA 


AGCAGCAGCA 


AGTCCGAAAA 


GAAAAAGAAG 


AAGGATAAAC 


ACAAACACAA 


3360 
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GGAGAAGGAC 


AAGTCCAAGG 


ACAAGACGGA 


AAAGGAGGAG 


CGTAAGAAGC 


ACAAGAGGGA 


3420 


CAAGCAGAAG 


GATCGTAGCG 


GCAGCGGTGG 


CAGCAAGGAC 


AGTTCTCTTC 


CCAATGAGCC 


3480 


TCTGAAGATG 


GTTATCAAGA 


ATCCCAACGG 


CAGCCTGCAG 


GCCGGTGCGT 


CAGCTCCCAT 


3540 


TAAACTTAAG 


ATCAGCAAAA 


ATAAGGTTGA 


ACCCAATAAC 


TACTCTGCAG 


CGGCGGGTCT 


3600 


GCCTGGCGCA 


ATCGGATATG 


GCTTGCCTCC 


AACTACGGCT 


ACCACCACAT 


CCGCTTCGAT 


3660 


CGGAGCAGCT 


GCTCCTGTTC 


TGCCTCCTTA 


TGGTGCCGGC 


GGTGGTGGCT 


ACAGCTCATC 


3720 


GGGCGGCAGC 


AGTTCCGGTG 


GCAGCAGCAA 


GAAAAAGCAC 


AGCGATCGTG 


ACCGCGACAA 


3780 


GGAGAGCAAA 


AAGAATAAGA 


GCCAAGACTA 


CGCGAAGTAC 


AATGGCGCTG 


GTGGCGGCAT 


3840 


CI 1 IAATCCC 


CTTGGCGGTG 


CTGGCGCCGC 


ACCCAATATG 


TCTGGAGGAA 


TGGGCGCCCC 


3900 


CATGTCTACT 


GCTGTACCAC 


CATCCATGCT 


GTTGGCGCCC 


ACCGGTGCAG 


TACCACCCTC 


3960 


TGCCGCTGGG 


CTGGCACCGC 


CTCCCATGCC 


CGTCTACAAC 


AAGAAGTAGT 


GGTAGCGGTC 


4020 


AGAGGGTTAT 


TCTTAAGTCG 


TACG 1 1 1 1 GA 


TATATGTATA 


GAACCTCAGT 


AAGTCCGATT 


4080 


GTAGTATAGT 


TGTTAGGATT 


GTTAGTGAGA 


-r/~/~ a i i a ii/" 

TGCATTATTG 


AIM IAGTTA 


AGCACATAGA 


A 1 A f\ 

414U 


TAAAACTCCA 


AATTGGAAGT 


GAAACCGGAT 


GCGCAGATCG 


AAGAAGAATG 


GAAGTAGATG 


4200 


TCGCGATGGG 


GCTGGACGTA 


AAAGCAGTAC 


TCAAATCGCG 


AAAAC 1 1 1 1 G 


TACAGCATTA 


4260 


ATTAGTTTAT 


AACTATAATA 


AATAGCATAC 


ATATAAGCCC 


AAAAAAAAAA 


AAAAAAAAAA 


4320 


AAAAAAAA 
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(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1097 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met Ser Leu Leu Ala Thr Pro Met Pro Gin Ala Ala Thr Ala Ser ser 
1 5 10 15 

Ser Ser ser Ala Ser Ala Ala Ala Ser Ala Ser Gly lie Pro lie Thr 
20 25 30 

Ala Asn Asn Asn Leu Pro Phe Glu Lys Asp Lys lie Trp Tyr Phe Ser 
35 40 45 

Asn Asp Gin Leu Ala Asn Leu Pro Ser Arg Arg Cys Gly lie Lys Gly 
50 55 60 

Asp Asp Glu Leu Gin Tyr Arg Gin Met Thr Ala Tyr Leu lie Gin Glu 
65 70 75 80 

Met Gly Gin Arg Leu Gin val Ser Gin Leu Cys lie Asn Thr Ala lie 
85 90 95 
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Val Tyr Met His Arg Phe Tyr Ala Phe His Ser Phe Thr His Phe His 
100 105 110 

Arg Asn Ser Met Ala Ser Ala Ser Leu Phe Leu Ala Ala Lys val Glu 
115 120 125 

Glu Gin Pro Arg Lys Leu Glu His val lie Arg Ala Ala Asn Lys Cys 
130 135 140 

Leu Pro Pro Thr Thr Glu Gin Asn Tyr Ala Glu Leu Ala Gin Glu Leu 
145 150 155 160 

val Phe Asn Glu Asn val Leu Leu Gin Thr Leu Gly Phe Asp Val Ala 
165 170 175 

lie Asp His Pro His Thr His val val Arg Thr Cys Gin Leu Val Lys 
180 185 190 

Ala Cys Lys Asp Leu Ala Gin Thr Ser Tyr Phe Leu Ala Ser Asn Ser 
195 200 205 

Leu His Leu Thr ser Met Cys Leu Gin Tyr Arg Pro Thr val val Ala 
210 215 220 

Cys Phe Cys lie Tyr Leu Ala Cys Lys Trp ser Arg Trp Glu lie Pro 
225 230 235 240 

Gin ser Thr Glu Gly Lys His Trp Phe Tyr Tyr val Asp Lys Thr val 
245 250 255 

Ser Leu Asp Leu Leu Lys Gin Leu Thr Asp Glu Phe lie Ala lie Tyr 
260 265 270 

Glu Lys Ser Pro Ala Arg Leu Lys Ser Lys Leu Asn Ser lie Lys Ala 
275 280 285 

lie Ala Gin Gly Ala Ser Asn Arg Thr Ala Asn Ser Lys Asp Lys Pro 
290 295 300 

Lys Glu Asp Trp Lys lie Thr Glu Met Met Lys Gly Tyr His Ser Asn 
305 310 315 320 

lie Thr Thr Pro Pro Glu Leu Leu Asn Gly Asn Asp Ser Arg Asp Arg 
325 330 335 

Asp Arg Asp Arg Glu Arg Glu Arg Glu Arg Glu Arg Asp Pro Ser Ser 
340 345 350 

Leu Leu Pro Pro Pro Ala Met val Pro Gin Gin Arg Arg Gin Asp Gly 
355 360 365 

Gly His Gin Arg Ser Ser Ser val Ser Gly val Pro Gly Ser Ser Ser 
370 375 380 

Ser Ser Ser Ser Ser Ser His Lys Met Pro Asn Tyr Pro Gly Gly Met 
385 390 395 400 

Pro Pro Glu Ala His Pro Asp His Lys Ser Lys Gin Pro Gly Tyr Asn 
405 410 415 

Asn Arg Met Pro Ser Ser His Gin Arg Ser Ser ser Ser Gly Leu Gly 
420 425 430 
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Ser Ser Gly Ser Gly Ser Gin His Ser Ser Ser Ser ser Ser Ser Ser 
435 440 445 

Ser Gin Gin Pro Gly Arg Pro Ser Met Pro val Asp Tyr His Lys Ser 
450 455 460 

Ser Arg Gly Met Pro Pro val Gly val Gly Met Pro Pro His Gly Ser 
465 470 475 480 

His Lys Met Thr Ser Gly Ser Lys Pro Gin Gin Pro Gin Gin Gin Pro 
485 490 495 

Val Pro His Pro Ser Ala Ser Asn Ser Ser Ala Ser Gly Met Ser Ser 
500 505 510 

Lys Asp Lys Ser Gin Ser Asn Lys Met Tyr Pro Asn Ala Pro Pro Pro 
515 520 525 

Tyr Ser Asn Ser Ala Pro Gin Asn Pro Leu Met Ser Arg Gly Gly Tyr 
530 535 540 

Pro Gly Ala Ser Asn Gly Ser Gin Pro Pro Pro Pro Ala Gly Tyr Gly 
545 550 555 560 

Gly His Arg Ser Lys Ser Gly Ser Thr val His Gly Met Pro His Phe 
565 570 575 

Glu Gin Gin Leu Pro Tyr Ser Gin Ser Gin Ser Tyr Gly His Met Gin 
580 585 590 

Gin Gin Pro val Pro Gin Ser Gin Gin Gin Gin Met Pro Pro Glu Ala 
595 600 605 

Ser Gin His Ser Leu Gin Ser Lys Asn Ser Leu Phe Ser Pro Glu Trp 
610 615 620 

Pro Asp lie Lys Lys Glu Pro Met Ser Gin Ser Gin Pro Gin Leu Phe 
625 630 635 640 

Asn Gly Leu Leu Pro Pro Pro Ala Pro Pro Gly His Asp Tyr Lys Leu 
645 650 655 

Asn Ser His Pro Arg Asp Lys Glu Ser Pro Lys Lys Glu Arg Leu Thr 
660 665 670 

Pro Thr Lys Lys Asp Lys His Arg Pro val Met Pro Pro Met Gly Ser 
675 680 685 

Gly Asn Ser Ser Ser Gly Ser Gly Ser ser Lys Pro Met Leu Pro Pro 
690 695 700 

His Lys Lys Gin lie Pro His Gly Gly Asp Leu Leu Thr Asn Pro Gly 
705 710 715 720 

Glu Ser Gly Ser Leu Lys Arg Pro Asn Glu lie Ser Gly Ser Gin Tyr 
725 730 735 

Gly Leu Asn Lys Leu Asp Glu lie Asp Asn Ser Asn Met Pro Arg Glu 
740 745 750 

Lys Leu Arg Lys Leu Asp Thr Thr Thr Gly Leu Pro Thr Tyr Pro Asn 
755 760 765 
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Tyr Glu Glu Lys His Thr Pro Leu Asn Met Ser Asn Gly lie Glu Thr 
770 775 ' 780 

Thr Pro Asp Leu val Arg Ser Leu Leu Lys Glu Ser Leu Cys Pro Ser 
785 790 795 800 

Asn Ala Ser Leu Leu Lys Pro Asp Ala Leu Thr Met Pro Gly Leu Lys 
805 810 815 

Pro Pro Ala Glu Leu Leu Glu Pro Met Pro Ala Pro Ala Thr lie Lys 
820 825 830 

Lys Glu Gin Gly lie Thr Pro Met Thr Ser Leu Ala Ser Gly Pro Ala 
835 840 845 

Pro Met Asp Leu Glu val Pro Thr Lys Gin Ala Gly Glu lie Lys Glu 
850 • 855 860 

Glu Ser ser Ser Lys Ser Glu Lys Lys Lys Lys Lys Asp Lys His Lys 
865 870 875 880 

His Lys Glu Lys Asp Lys Ser Lys Asp Lys Thr Glu Lys Glu Glu Arg 
885 890 895 

Lys Lys His Lys Arg Asp Lys Gin Lys Asp Arg ser Gly Ser Gly Gly 
900 905 910 

Ser Lys Asp Ser Ser Leu Pro Asn Glu Pro Leu Lys Met val lie Lys 
915 920 925 

Asn Pro Asn Gly Ser Leu Gin Ala Gly Ala Ser Ala Pro lie Lys Leu 
930 935 940 

Lys lie Ser Lys Asn Lys val Glu Pro Asn Asn Tyr Ser Ala Ala Ala 
945 950 955 960 

Gly Leu Pro Gly Ala lie Gly Tyr Gly Leu Pro Pro Thr Thr Ala Thr 
965 970 975 

Thr Thr ser Ala Ser tie Gly Ala Ala Ala Pro val Leu Pro Pro Tyr 
980 985 990 

Gly Ala Gly Gly Gly Gly Tyr ser ser ser Gly Gly Ser ser Ser Gly 
995 1000 1005 

Gly Ser Ser Lys Lys Lys His Ser Asp Arg Asp Arg Asp Lys Glu Ser 
1010 1015 1020 

Lys Lys Asn Lys Ser Gin Asp Tyr Ala Lys Tyr Asn Gly Ala Gly Gly 
1025 1030 1035 1040 

Gly lie Phe Asn Pro Leu Gly Gly Ala Gly Ala Ala Pro Asn Met Ser 
1045 1050 1055 

Gly Gly Met Gly Ala Pro Met Ser Thr Ala val Pro Pro Ser Met Leu 
1060 1065 1070 

Leu Ala Pro Thr Gly Ala val Pro Pro Ser Ala Ala Gly Leu Ala Pro 
1075 1080 1085 

Pro Pro Met Pro val Tyr Asn Lys Lys 
1090 1095 
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(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) length: 1119 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1. .1116 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 
ATG GCA AAG CAG TAC GAC TCG GTG GAG TGC CCT TTT TGT GAT GAA GTT 

Met Ala Lys Gin Tyr Asp Ser val Glu Cys Pro Phe Cys Asp Glu val 
15 10 15 

TCC AAA TAC GAG AAG CTC GCC AAG ATC GGC CAA GGC ACC TTC GGG GAG 
Ser Lys Tyr Glu Lys Leu Ala Lys lie Gly Gin Gly Thr Phe Gly Glu 
20 25 30 

GTG TTC AAG GCC AGG CAC CGC AAG ACC GGC CAG AAG GTG GCT CTG AAG 
Val Phe Lys Ala Arg His Arg Lys Thr Gly Gin Lys Val Ala Leu Lys 
35 40 45 

AAG GTG CTG ATG GAA AAC GAG AAG GAG GGG TTC CCC ATT ACA GCC TTG 
Lys Val Leu Met Glu Asn Glu Lys Glu Gly Phe Pro lie Thr Ala Leu 
50 55 60 

CGG GAG ATC AAG ATC CTT CAG CTT CTA AAA CAC GAG AAT GTG GTC AAC 
Arg Glu lie Lys lie Leu Gin Leu Leu Lys His Glu Asn Val val Asn 
65 70 75 80 

TTG ATT GAG ATT TGT CGA ACC AAA GCT TCC CCC TAT AAC CGC TGC AAG 
Leu lie Glu lie Cys Arg Thr Lys Ala Ser Pro Tyr Asn Arg Cys Lys 
85 90 95 

GGT AGT ATA TAC CTG GTG TTC GAC TTC TGC GAG CAT GAC CTT GCT GGG 

Gly Ser lie Tyr Leu val Phe Asp Phe Cys Glu His Asp Leu Ala Gly 
100 105 110 

CTG TTG AGC AAT GTT TTG GTC AAG TTC ACG CTG TCT GAG ATC AAG AGG 
Leu Leu Ser Asn Val Leu val Lys Phe Thr Leu Ser Glu lie Lys Arg 
115 120 125 

GTG ATG CAG ATG CTG CTT AAC GGC CTC TAC TAC ATC CAC AGA AAC AAG 

val Met Gin Met Leu Leu Asn Gly Leu Tyr Tyr lie His Arg Asn Lys 
130 135 140 

ATC CTG CAT AGG GAC ATG AAG GCT GCT AAT GTG CTT ATC ACT CGT GAT 

lie Leu His Arg Asp Met Lys Ala Ala Asn val Leu lie Thr Arg Asp 
145 150 155 160 

GGG GTC CTG AAG CTG GCA GAC TTT GGG CTG GCC CGG GCC TTC AGC CTG 
Gly val Leu Lys Leu Ala Asp Phe Gly Leu Ala Arg Ala Phe Ser Leu 
165 170 175 

GCC AAG AAC AGC CAG CCC AAC CGC TAC ACC AAC CGT GTG GTG ACA CTC 

Ala Lys Asn Ser Gin Pro Asn Arg Tyr Thr Asn Arg Val Val Thr Leu 

180 185 190 
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TGG TAC CGG CCC CCG GAG CTG TTG CTC GGG GAG CGG GAC TAC GGC CCC 624 
Trp Tyr Arg Pro Pro Glu Leu Leu Leu Gly Glu Arg Asp Tyr Gly Pro 
195 200 205 

CCC ATT GAC CTG TGG GGT GCT GGG TGC ATC ATG GCA GAG ATG TGG ACC 672 
Pro lie Asp Leu Trp Gly Ala Gly Cys lie Met Ala Glu Met Trp Thr 
210 215 220 

CGC AGC CCC ATC ATG CAG GGC AAC ACG GAG CAG CAC CAA CTC GCC CTC 720 
Arg Ser Pro lie Met Gin Gly Asn Thr Glu Gin His Gin Leu Ala Leu 
225 230 235 240 

ATC AGT CAG CTC TGC GGC TCC ATC ACC CCT GAG GTG TGG CCA AAC GTG 768 
lie Ser Gin Leu Cys Gly Ser lie Thr Pro Glu val Trp Pro Asn val 
245 250 255 

GAC AAC TAT GAG CTG TAC GAA AAG CTG GAG CTG GTC AAG GGC CAG AAG 816 
Asp Asn Tyr Glu Leu Tyr Glu Lys Leu Glu Leu Val Lys Gly Gin Lys 
260 265 270 

CGG AAG GTG AAG GAC AGG CTG AAG GCC TAT GTG CGT GAC CCA TAC GCA 864 

Arg Lys val Lys Asp Arg Leu Lys Ala Tyr val Arg Asp Pro Tyr Ala 
275 280 285 

CTG GAC CTC ATC GAC AAG CTG CTG GTG CTG GAC CCT GCC CAG CGC ATC 912 

Leu Asp Leu lie Asp Lys Leu Leu val Leu Asp Pro Ala Gin Arg lie 
290 295 300 

GAC AGC GAT GAC GCC CTC AAC CAC GAC TTC TTC TGG TCC GAC CCC ATG 960 

Asp Ser Asp Asp Ala Leu Asn His Asp Phe Phe Trp Ser Asp Pro Met 
305 310 315 320 

CCC TCC GAC CTC AAG GGC ATG CTC TCC ACC CAC CTG ACG TCC ATG TTC 1008 
Pro Ser Asp Leu Lys Gly Met Leu Ser Thr His Leu Thr Ser Met Phe 
325 330 335 

GAG TAC TTG GCA CCA CCG CGC CGG AAG GGC AGC CAG ATC ACC CAG CAG 1056 

Glu Tyr Leu Ala Pro Pro Arg Arg Lys Gly Ser Gin lie Thr Gin Gin 
340 345 350 

TCC ACC AAC CAG AGT CGC AAT CCC GCC ACC ACC AAC CAG ACG GAG TTT 1104 
Ser Thr Asn Gin Ser Arg Asn Pro Ala Thr Thr Asn Gin Thr Glu Phe 
355 360 365 

GAG CGC GTC TTC TGA 1119 
Glu Arg Val Phe 
370 



(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 372 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) molecule type: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met Ala Lys Gin Tyr Asp Ser val Glu cys Pro Phe Cys Asp Glu val 
15 10 15 
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Ser Lys Tyr Glu Lys Leu Ala Lys lie Gly Gin Gly Thr Phe Gly Glu 
20 25 30 

Val Phe Lys Ala Arg His Arg Lys Thr Gly Gin Lys val Ala Leu Lys 
35 40 45 

Lys val Leu Met Glu Asn Glu Lys Glu Gly Phe Pro lie Thr Ala Leu 
50 55 60 

Arg Glu lie Lys lie Leu Gin Leu Leu Lys His Glu Asn val val Asn 
65 70 75 80 

Leu lie Glu lie Cys Arg Thr Lys Ala ser Pro Tyr Asn Arg cys Lys 
85 90 95 

Gly Ser lie Tyr Leu val Phe Asp Phe Cys Glu His Asp Leu Ala Gly 
100 105 110 

Leu Leu Ser Asn val Leu val Lys Phe Thr Leu Ser Glu lie Lys Arg 
115 120 125 

val Met Gin Met Leu Leu Asn Gly Leu Tyr Tyr lie His Arg Asn Lys 
130 135 140 

lie Leu His Arg Asp Met Lys Ala Ala Asn val Leu lie Thr Arg Asp 
145 150 155 160 

Gly val Leu Lys Leu Ala Asp Phe Gly Leu Ala Arg Ala Phe Ser Leu 
165 170 175 

Ala Lys Asn Ser Gin Pro Asn Arg Tyr Thr Asn Arg val val Thr Leu 
180 185 190 

Trp Tyr Arg Pro Pro Glu Leu Leu Leu Gly Glu Arg Asp Tyr Gly Pro 
195 200 205 

Pro lie Asp Leu Trp Gly Ala Gly Cys lie Met Ala Glu Met Trp Thr 
210 215 220 

Arg Ser Pro lie Met Gin Gly Asn Thr Glu Gin His Gin Leu Ala Leu 
225 230 235 240 

lie Ser Gin Leu cys Gly Ser lie Thr pro Glu val Trp Pro Asn val 
245 250 255 

Asp Asn Tyr Glu Leu Tyr Glu Lys Leu Glu Leu Val Lys Gly Gin Lys 
260 265 270 

Arg Lys val Lys Asp Arg Leu Lys Ala Tyr val Arg Asp Pro Tyr Ala 
275 280 285 

Leu Asp Leu lie Asp Lys Leu Leu Val Leu Asp Pro Ala Gin Arg lie 
290 295 300 

Asp Ser Asp Asp Ala Leu Asn His Asp Phe Phe Trp ser Asp Pro Met 
305 310 315 320 

Pro Ser Asp Leu Lys Gly Met Leu ser Thr His Leu Thr Ser Met Phe 
325 330 335. 

Glu Tyr Leu Ala Pro Pro Arg Arg Lys Gly Ser Gin lie Thr Gin Gin 
340 345 350 
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Ser Thr Asn Gin Ser Arg Asn Pro Ala Thr Thr Asn Gin Thr Glu Phe 
355 360 365 

Glu Arg val Phe 
370 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
ACGAATTCCA CACAATCCAA AGATC 25 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 
CAGAATTCCT ATTGCCGATC CCCAGA 26 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: modi f i ed_base 

(B) LOCATION: one-of(8, 14) 

(D) OTHER INFORMATION: /mod_base= OTHER 

/note= "n = A or C or G or T" 

(ix) FEATURE: 

(A) NAME/KEY: modi f i ed_base 

(B) LOCATION: 12 

(D) OTHER INFORMATION: /mod_base= OTHER 

/note= "Y = C or T" 

(ix) FEATURE: 

(A) name/key: modi f i ed_base 

(B) LOCATION: one-of(17, 20) 

(D) OTHER INFORMATION: /mod_base= OTHER 
/note= "R = A or 6" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

GGAATTCNAT GYTNCARCAR CC 22 
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(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) type: nucleic acid 
CO STRANDEDNESS: single 
(d) topology: linear 

(ix) feature: 

(a) name/key: modifiecLbase 

(B) LOCATION: one-of(13, 16, 19, 22, 25) 

(D) OTHER INFORMATION: /mod_base= OTHER 

/note= "R = A or G" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
AACTGCAGTC CARAARAART CRTGRTT 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
TGTCAAGGAT CAAACCGGCT GTGAT 



(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
CGAATTCCAA GAAACGCATC GATGC 



(2) INFORMATION FOR SEQ ID NO: 13: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
AGACCTGCCA AATCGTGT 



(2) INFORMATION FOR SEQ ID NO: 14: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
AGAAGGTGGA TCTGTAACCA TTCGT 



(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) topology: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 
GGAATTCAGA TCTCGATCAG ATTCA 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
TTACTACTCG AGCTACCAAA CCCGGTC 



(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
TAAGCAAGCT TCTATGGCGC ACATGTCC 



(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
TTACTACTCG AGCTACCAAA CCCGGTC 



(2) INFORMATION FOR SEQ ID NO: 19: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: modi f i ed_base 

(B) LOCATION: one-of(13, 16, 22) 

(D) OTHER INFORMATION: /mod_base= OTHER 

/note= 11 Y = C or T" 

(ix) FEATURE: 

(A) name/key: modi f i ed_base 
(b) location: 17 

(D) other information: /mod_base= other 
/note= n w = A or T" 

(ix) feature: 

(a) name/key: modified_base 

(B) location: 18 

(D) other information: /mod_base= other 
/note= "s = c or G" 

(ix) feature: 

(A) NAME/KEY: modi f i ed_base 

(B) LOCATION: 19 

(D) OTHER INFORMATION: /mod_base= OTHER 

/note= "N = A or C or G or T" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
GGAATTCTGG TAYTTYWSNA AYGA 



(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) NAME/KEY: modi f i ed_base 

(B) LOCATION: 11 

(D) OTHER INFORMATION: /mod_base= OTHER 

/note= "y = C or T" 

(ix) FEATURE: 

(A) name/key: modi f i ed_base 

(B) location: 14 

(D) OTHER INFORMATION: /mod_base= OTHER 

/note= "R = A or G" 

(ix) FEATURE: 

(A) name/key: modified_base 

(B) LOCATION: one-of(17, 20) 

(D) OTHER INFORMATION: /mod_base= OTHER 

/note= n N = A or C or G or t" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
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(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ix) FEATURE: 

(A) name/key: modi f i ecLbase 

(B) LOCATION: one-of(ll, 14, 20) 

(D) OTHER INFORMATION: /mod_base= OTHER 

/note= "N = A or C or G or T" 

(ix) FEATURE: 

(A) NAME/KEY: modi f i ed_base 

(B) LOCATION: 23 

(D) OTHER INFORMATION: /mod_base= OTHER 

/note= "R = A or G" 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
CGGGATCCAA NGGNGGCATN CCRT 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
ATCACGACAC CACCAGAGCT GTTA 



(2) INFORMATION FOR SEQ ID NO: 23: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base paiirs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
CGAATTCAGA TCGTGAACGG GA 



(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) type: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 

CGAATTCAGG CGCTAGCAAT G 



(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) length: 17 base pairs 

(B) type: nucleic acid 

(C) STRANDEDNESS: single 

(D) topology: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
GAAAGGCGTA GAACCGA 



(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 
GCTGACCCAT TTCCTGTATC AGATAG 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
GGAATTCTTC TGCTTGGCGA AT 



(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 
(c) strandedness: single 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 

GGGAATTCGA GGTTCTATAC ATAT 



(2) INFORMATION FOR SEQ ID NO:29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
CTGTGTGAAT GGAATCTGTG ATGTG 25 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
TATCCCGGGT CATATGAGTC TCCTAGCC 28 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 

Met Leu Gin Gin Pro Ser Gly Ser Thr Pro Ser Asn val 
1 5 10 

(2) INFORMATION FOR SEQ ID NO: 32: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

Ala Asp Thr Ala Leu Asn His Asp Phe Phe Trp Thr Asp Pro Met Pro 
15 10 15 

Ser 



(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

Met Leu Gin Gin Pro 
1 5 
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(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

Asn His Asp Phe Phe Trp Thr 
1 5 



(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

Ser Pro Glu Trp Pro Asp lie 
1 5 



(2) INFORMATION FOR SEQ ID NO: 36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 36: 

Trp Tyr Phe Ser Asn Asp Gin Leu Ala Asn Ser Pro Ser Arg 
15 10 



(2) INFORMATION FOR SEQ ID NO: 37: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 

Thr Val His Gly Met Pro Pro Phe Glu Gin Gin Leu Pro Tyr 
15 10 



(2) INFORMATION FOR SEQ ID NO: 38: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 
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(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 

Trp Tyr Phe Ser Asn Asp 
1 5 



(2) INFORMATION FOR SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 39: 

Met Pro Pro Phe Glu Gin 
1 5 



(2) INFORMATION FOR SEQ ID NO: 40: 
(i) SEQUENCE CHARACTERISTICS: 

(A) length: 6 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY:* linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

His Gly Met Pro Pro Phe 
1 5 



(2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 
GCAGGATCCA GAATTCCATA TGGCAAAGCA GTACGACTCG G 



(2) INFORMATION FOR SEQ ID NO:42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 
CAGTACTCGA GTTATCAGAA GACGCGCTCA AAC 
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(2) INFORMATION FOR SEQ ID NO: 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4528 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 



GGGGGGGGGG 


GGGTGAATGA 


AGGAGCGGGC 


GGAGGAGGAA 


TTGTCATGGC 


GTCGGGCCGT 


60 


GGAGCTTCTT 


CTCGCTGGTT 


CI 1 IACTCGG 


GAACAGCTGG 


AGAACACGCC 


GAGCCGCCGC 


120 


TGCGGAGTGG 


AGGCGGATAA 


AGAGCTCTCG 


TGCCGCCAGC 


AGGCGGCCAA 


CCTCATCCAG 


180 


GAGATGGGAC 


AGCGTCTCAA 


TGTCTCTCAG 


CTTACAATAA 


ACACTGCGAT 


TGTTTATATG 


240 


CACAGG 1 1 1 1 


ATATGCACCA 


1 ILI 1 ICACC 


AAATTCAACA 


AAAATATAAT 


ATCGTCTACT 


300 


GCATTAI 1 1 1 


TGGCTGCAAA 


AGTGGAAGAA 


CAGGCTCGAA 


AACTTGAACA 


TGTTATCAAA 


360 


GTAGCACATG 


CTTGTCTTCA 


TCCTCTAGAG 


CCACTGCTGG 


ATACTAAATG 


TGATGCTTAC 


420 


CTTCAACAGA 


CTCAAGAACT 


GGTTATACTT 


GAAACCATAA 


TGCTACAAAC 


TCTAGGI 1 1 1 


480 


GAGATCACCA 


TTGAACACCC 


ACACACAGAT 


GTGGTGAAAT 


GTACCCAGTT 


AGTAAGAGCA 


540 


AGCAAGGATT 


TGGCACAGAC 


ATCCTATTTC 


ATGGCTACCA 


ACAGTCTGCA 


TCTTACAACC 


600 


TTCTGTCTTC 


AGTACAAACC 


AACAGTGATA 


GCATGTGTAT 


GCATTCATTT 


GGCTTGCAAA 


660 


TGGTCCAATT 


GGGAGATCCC 


TGTATCAACT 


GATGGAAAGC 


ATTGGTGGGA 


ATATGTGGAT 


720 


CCTACAGTTA 


CTCTAGAATT 


ATTAGATGAG 


CTAACACATG 


AGTTTCTACA 


AATATTGGAG 


780 


AAAACGCCTA 


ATAGGTTGAA 


GAAGATTCGA 


AACTGGAGGG 


CTAATCAGGC 


AGCTAGGAAA 


840 


CCAAAAGTAG 


ATGGACAGGT 


ATCAGAGACA 


CCACTTCTTG 


GTTCATC 1 1 1 


GGTCCAGAAT 


900 


TCCAI 1 1 1 AG 


TAGATAGTGT 


CACTGGTGTG 


CCTACAAACC 


CAAG 1 1 1 1 CA 


GAAACCATCT 


960 


ACATCAGCAT 


TCCCTGCGCC 


AGTACCTCTA 


AATTCAGGAA 


ATAI 1 ICIGI 


TCAAGACAGC 


1020 


CATACATCTG 


ATAAI I IGIC 


AATGCTAGCA 


ACAGGAATGC 


CAAGTACTTC 


ATACGGTTTA 


1080 


TCATCACACC 


AGGAATGGCC 


TCAACATCAA 


GACTCAGCAA 


GGACAGAACA 


GCTATATTCA 


1140 


CAGAAACAGG 


AGACATC 1 1 1 


GTCTGGTAGC 


CAGTACAACA 


TCAACTTCCA 


GCAGGGACCT 


1200 


TCTATATCAC 


TGCATTCAGG 


ATTACATCAC 


AGACCTGACA 


AAATTTCAGA 


TCATTCTTCT 


1260 


GTTAAGCAAG 


AATATACTCA 


TAAAGCAGGG 


AGCAGTAAAC 


ACCATGGGCC 


AATTTCCACT 


1320 


ACTCCAGGAA 


TAATTCCTCA 


GAAAATGTCT 


TTAGATAAAT 


ATAGAGAAAA 


GCGTAAACTA 


1380 


GAAACTCTTG 


ATCTCGATGT 


AAGGGATCAT 


TATATAGCTG 


CCCAGGTAGA 


ACAGCAGCAC 


1440 


AAACAAGGGC 


AGTCACAGGC 


AGCCAGCAGC 


AGTTCTGTTA 


CTTCTCCCAT 


TAAAATGAAA 


1500 


ATACCTATCG 


CAAATACTGA 


AAAATACATG 


GCAGATAAAA 


AGGAAAAGAG 


TGGGTCACTG 


1560 


AAATTACGGA 


TTCCAATACC 


ACCCACTGAT 


AAAAGCGCCA 


GTAAAGAAGA 


ACTGAAAATG 


1620 
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AAAATAAAAG 


1 1 ICI IC! IC 


AGAAAGACAC 


AGCTCTTCTG 


ATGAAGGCAG 


TGGGAAAAGC 


1680 


AAACATTCAA 


GCCCACATAT 


TAGCAGAGAC 


CATAAGGAGA 


AGCACAAGGA 


GCATCCTTCA 


1740 


AGCCGCCACC 


ACACCAGCAG 


CCACAAGCAT 


TCCCACTCGC 


ATAGTGGCAG 


CAGCAGCGGT 


1800 


GGCAGTAAAC 


ACAGTGCCGA 


CGGAATACCA 


CCCACTGTTC 


TGAGGAGTCC 


TGTTGGCCTG 


1860 


AGCAGTGATG 


GCATTTCCTC 


TAGCTCCAGC 


TCTTCAAGGA 


AGAGGCTGCA 


TGTCAATGAT 


1920 


GCATCTCACA 


ACCACCACTC 


CAAAATGAGC 


AAAAGTTCCA 


AAAGTTCAGG 


TGGGCTACGG 


1980 


ACATCTCAGC 


ACCTCGTGAA 


ACTGGACAAG 


AAGCCAGTGG 


AGACCAACGG 


TCCTGATGCC 


2040 


AATCACGAGT 


ACAGTACAAG 


CAGCCAGCAT 


ATGGACTACA 


AAGACACATT 


CGACATGCTG 


2100 


GACTCACTGT 


TAAGTGCCCA 


AGGAATGAAC 


ATGTAATAAT 


1 IGI 1 IAGGT 


CAAI 1 1 1 ICC 


2160 


TTTAC 1 1 1 1 1 


TAATTTAAAA 


ATTGTTAGAA 


TGGAAAAATT 


CCTTCTGATC 


TAGCAGTGGT 


2220 


AACCCCTGCT 


GTTGCTGCCA 


CTGCTTCAAT 


ATTTGTAAGT 


GCTACI 1 1 AT 


TCTTCATTCT 


2280 


GAAAAGAAGA 


GATTATAGTA 


AACAAGTCTT 


TATCTCCACA 


TATGATAGTG 


TTATAAATAC 


2340 


TGTAAAGGCA 


TGGAAGGTGC 


AAAACTCAGT 


Al 1 ICIACAA 


TTGCAGCTAA 


GAACATTAGG 


2400 


ATGAATGGCT 


GGCTGCTTCT 


AGGAATATAA 


GATGCCTCAA 


GCATTCATTA 


TTTATGATTT 


2460 


GAATACTGTA 


GCTAI Mill 


GTTGCTTGGC 


1 1 1 1 GAATGA 


GTGTAAATTG 


1 1 1 ICI 1 1 IG 


2520 


TGTATTTATA 


CTTGTATGTA 


TGATTTGCAT 


Gill CAATGA 


TAAAGGGATA 


AAACAGTATA 


2580 


CTGACAACTG 


TTTACAAGAA 


AGTGGAGAAA 


ATGTACTACA 


1 1 1 IGIATGT 


TTAGATATTA 


2640 


CCGTAAATAC 


TCAGGATTGG 


AGCTGCTTGT 


AAGTATAACA 


ATATACAGAA 


TACI 1 IATTT 


2700 


TATCTTGTCA 


GAGTTCCATC 


ACTATCTAAA 


ACAAAGGTGC 


AA 1 1 1 1 1 1 AT 


GTTAACCTTA 


2760 


AATCTAGCCC 


TTACTGGAAG 


CCACTGATAG 


GGACATTCAC 


TACCAGATGT 


GTGCAGTGCA 


2820 


GCAGATGGTC 


ATATAACACT 


GTGAGGCACT 


GAAI 1 1 IGCC 


TTCAGAGGTT 


CTGACCAGAT 


2880 


TGGCTGCTGA 


AATAGCCCCT 


AAC 1 1 1 C 1 GA 


AGGCTTGAAG 


AGGaAAAAAT 


AAAGTTTACA 


2940 


TACTCTTGAT 


GGAAGTGCAT 


TTAAATG 1 1 1 


GTTGGCTTGT 


TGCAGTTCTA 


TGAAACAGAG 


3000 


CTGTTAATAA 


TGGTTATGTG 


GATTACTGTG 


ATTTGAAAAC 


TAAATTCACA 


ATAACTTACC 


3060 


TAGTAGAGAT 


TTAGTGAGTT 


Gl 1 ICCI 1 IA 


AAGAAI 1 1 IA 


CACTACATAT 


TTTAATAGTA 


3120 


AACAGGGTCA 


CI 1 ICCI 1 IA 


GCATTCAGAA 


TGACACCATA 


TTCTTAAATA 


TACTCCTTCC 


3180 


V. 1 VjAAuL.U 1 V3 


1 1 lulululVi 


A 1 uLLA 1 A 1 1 


1 V. 1 II II V_A<J 


/ — r A a AX/ — TA f 
\3 1 AAA 1 \3 1 Avj 


Tt — 1 — T(~ t — T — T A T" 
1 v_ 1 ■ MAI 




AAAAATGAAA 


TTAAACCTAT 


GCTCTCAATT 


CI 1 1 IATATT 


CTAACAATAA 


ATAAAAAAGA 


3300 


AAAGATTACT 


GACTGTGCAT 


TGTACCTGTA 


TTTATAGTTT 


ATGGTTATCA 


GAAGCTCTGT 


3360 


AAGAAAGAAA 


AGGTCAGCTC 


CCAGGCAAAC 


CAGTAGTGGA 


GGTTTTACAT 


1 IGI 1 1 GCAC 


3420 


ATCTCAGTAT 


Al 1 1CIGI IG 


AGGTAAAGTT 


TGCACAGTCA 


TCTGACTTCT 


GATCAAGCAT 


3480 
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TAGAI 1 1 IAA 


CI IGI 1 1 AGA 


1 1 1 IGICI IA 


AACACCAGTA 


ATATGGCTCT 


TGTTTATCAG 


3540 


CTAATCTTGA 


ATTTATTCTG 


TGGTAAATCT 


TTTGAGTTGC 


TGAGTATATT 


TGAGATTGAT 


3600 


TGGATTCAAC 


CTCTTGTTGA 


ACTGAAAACT 


T AA MIMIC 


TCTGTA 1 1 1 1 


TGTTACAAAG 


3660 


CCACTGATAC 


GTGCACAATT 


GTAATTAAGT 


ATGTTGCAGT 


TGTAAATATT 


AGAGTTTAAT 


3720 


CTCATGCTCT 


ACU MAIM 


AGCAATTACC 


TAATTTGCCA 


GTAGCI 1 1 AT 


AAM II 1 AAA 


3780 


GATAATTGTT 


CATTA 1 1 II G 


TCAATGTTAT 


TTGAACTTGG 


GGTACTTAGG 


AGCC 1 C 1 1 1 G 


3840 


TAGGGACTGT 


GCCTAGGTAG 


CATGTCCTAA 


CAI 1 IGI ICI 


GGTCTTGCAT 


AACTTCAGTA 


3900 


TCTTTGTCAT 


TATATGTAAC 


1 II G II GCTC 


TGTATGGCAT 


AATATTGTAT 


CCATAAACAT 


3960 


GGTAA 1 1 1 1 G 


ATACAGTTAT 


AC 1 II 1 ACAG 


TGGTACATAA 


TCCAAGGACT 


AGTATAGAAT 


4020 


TAAGCTGAGT 


GCAAGATGAG 


GGAGGGAAGG 


GCI 1 ICI IGG 


TAATTTAGAT 


GTGAAACCTC 


4080 


TACAGAGCTA 


TCATGTAAAA 


ACTACATGAG 


GTGGTTGTGC 


TACTGTATAA 


TTGGGGGTGA 


4140 


TAATACCAGG 


AAI II IAATA 


AGA 1 1 1 1 G 1 A 


AAGAATATCC 


AGAAAAGTAG 


TGAACTTATT 


4200 


TTCAGTAGGC 


ATAGAAAACA 


ATGTGAATAT 


TTAAGGTCTG 


TGACTATAGT 


TAAACTTCAC 


4260 


TAAGAATTTG 


CAGAATTGTT 


TTGAGATGTG 


TGAATAAAGG 


TAAI 1 1 1 ATT 


GAATCTTCAT 


4320 


TGGTGCTAAT 


GTTGGACAGT 


TAAAAAGATA 


GCTAGTGTAT 


ATTGTTATGG 


GTCAGTACTT 


4380 


ATTAGTACTT 


CCAAAATTGA 


ATTTGAAATG 


CTATGTATTC 


ACI 1 1 ICACT 


CTGTAAATGT 


4440 


AATTCTTTAC 


AATGACI 1 IA 


TTTATTAAAG 


GGCAGCCAGT 


TGTCAI 1 IGI 


AAAAAAAAAA 


4500 


AAAAAAAAAA 


AAAGCGGCCG 


CTGAATTC 








4528 



(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2091 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

ATGGCGTCGG GCCGTGGAGC TTCTTCTCGC TGGTTCTTTA CTCGGGAACA GCTGGAGAAC 60 

ACGCCGAGCC GCCGCTGCGG AGTGGAGGCG GATAAAGAGC TCTCGTGCCG CCAGCAGGCG 120 

GCCAACCTCA TCCAGGAGAT GGGACAGCGT CTCAATGTCT CTCAGCTTAC AATAAACACT 180 

GCGATTGTTT ATATGCACAG GTTTTATATG CACCATTCTT TCACCAAATT CAACAAAAAT 240 

ATAATATCGT CTACTGCATT All II IGGCT GCAAAAGTGG AAGAACAGGC TCGAAAACTT 300 

GAACATGTTA TCAAAGTAGC ACATGCTTGT CTTCATCCTC TAGAGCCACT GCTGGATACT 360 

AAATGTGATG CTTACCTTCA ACAGACTCAA GAACTGGTTA TACTTGAAAC CATAATGCTA 420 

CAAACTCTAG GTTTTGAGAT CACCATTGAA CACCCACACA CAGATGTGGT GAAATGTACC 480 
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CAGTTAGTAA 


GAGCAAGCAA 


GGATTTGGCA 


CAGACATCCT 


ATTTCATGGC 


TACCAACAGT 


540 


CTGCATCTTA 


CAACCTTCTG 


TCTTCAGTAC 


AAACCAACAG 


TGATAGCATG 


TGTATGCATT 


600 


CATTTGGCTT 


GCAAATGGTC 


CAATTGGGAG 


ATCCCTGTAT 


CAACTGATGG 


AAAGCATTGG 


660 


TGGGAATATG 


TGGATCCTAC 


AGTTACTCTA 


GAATTATTAG 


ATGAGCTAAC 


ACATGAGTTT 


720 


CTACAAATAT 


TGGAGAAAAC 


GCCTAATAGG 


TTGAAGAAGA 


TTCGAAACTG 


GAGGGCTAAT 


780 


CAGGCAGCTA 


GGAAACCAAA 


AGTAGATGGA 


CAGGTATCAG 


AGACACCACT 


TCTTGGTTCA 


840 


TCI 1 IGGTCC 


AGAATTCCAT 


TTTAGTAGAT 


AGTGTCACTG 


GTGTGCCTAC 


AAACCCAAGT 


900 


TTTCAGAAAC 


CATCTACATC 


AGCATTCCCT 


GCGCCAGTAC 


CTCTAAATTC 


AGGAAATATT 


960 


TCTGTTCAAG 


ACAGCCATAC 


ATCTGATAAT 


TTGTCAATGC 


TAGCAACAGG 


AATGCCAAGT 


1020 


ACTTCATACG 


Gl 1 IATCATC 


ACACCAGGAA 


TGGCCTCAAC 


ATCAAGACTC 


AGCAAGGACA 


1080 


GAACAGCTAT 


ATTCACAGAA 


ACAGGAGACA 


TCI 1 IGICIG 


GTAGCCAGTA 


CAACATCAAC 


1140 


TTCCAGCAGG 


GACCTTCTAT 


ATCACTGCAT 


TCAGGATTAC 


ATCACAGACC 


TGACAAAATT 


1200 


TCAGATCATT 


CTTCTGTTAA 


GCAAGAATAT 


ACTCATAAAG 


CAGGGAGCAG 


TAAACACCAT 


1260 


GGGCCAATTT 


CCACTACTCC 


AGGAATAATT 


CCTCAGAAAA 


TGI CI 1 1 AGA 


TAAATATAGA 


1320 


GAAAAGCGTA 


AACTAGAAAC 


TCTTGATCTC 


GATGTAAGGG 


ATCATTATAT 


AGCTGCCCAG 


1380 


GTAGAACAGC 


AGCACAAACA 


AGGGCAGTCA 


CAGGCAGCCA 


GCAGCAGTTC 


TGTTACTTCT 


1440 


CCCATTAAAA 


TGAAAATACC 


TATCGCAAAT 


ACTGAAAAAT 


ACATGGCAGA 


TAAAAAGGAA 


1500 


AAGAGTGGGT 


CACTGAAATT 


ACGGATTCCA 


ATACCACCCA 


CTGATAAAAG 


CGCCAGTAAA 


1560 


GAAGAACTGA 


AAATGAAAAT 


AAAAG 1 1 1 C 1 


TCTTCAGAAA 


GACACAGCTC 


TTCTGATGAA 


1620 


GGCAGTGGGA 


AAAGCAAACA 


TTCAAGCCCA 


CATATTAGCA 


GAGACCATAA 


GGAGAAGCAC 


1680 


AAGGAGCATC 


CTTCAAGCCG 


CCACCACACC 


AGCAGCCACA 


AGCATTCCCA 


CTCGCATAGT 


1740 


GGCAGCAGCA 


GCGGTGGCAG 


TAAACACAGT 


GCCGACGGAA 


TACCACCCAC 


TGTTCTGAGG 


1800 


AGTCCTGTTG 


GCCTGAGCAG 


TGATGGCATT 


TCCTCTAGCT 


CCAGCTCTTC 


AAGGAAGAGG 


1860 


CTGCATGTCA 


ATGATGCATC 


TCACAACCAC 


CACTCCAAAA 


TGAGCAAAAG 


TTCCAAAAGT 


1920 


TCAGGTGGGC 


TACGGACATC 


TCAGCACCTC 


GTGAAACTGG 


ACAAGAAGCC 


AGTGGAGACC 


1980 


AACGGTCCTG 


ATGCCAATCA 


CGAGTACAGT 


ACAAGCAGCC 


AGCATATGGA 


CTACAAAGAC 


2040 


ACATTCGACA 


TGCTGGACTC 


ACTGTTAAGT 


GCCCAAGGAA 


TGAACATGTA 


A 


2091 



(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 696 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

Met Ala Ser Gly Arg Gly Ala Ser ser Arg Trp Phe Phe Thr Arg Glu 
15 10 15 

Gin Leu Glu Asn Thr Pro Ser Arg Arg Cys Gly val Glu Ala Asp Lys 
20 25 30 

Glu Leu ser cys Arg Gin Gin Ala Ala Asn Leu lie Gin Glu Met Gly 
35 40 45 

Gin Arg Leu Asn val Ser Gin Leu Thr lie Asn Thr Ala lie val Tyr 
50 55 60 

Met His Arg Phe Tyr Met His His Ser Phe Thr Lys Phe Asn Lys Asn 
65 70 75 80 

lie lie Ser Ser Thr Ala Leu Phe Leu Ala Ala Lys val Glu Glu Gin 
85 90 95 

Ala Arg Lys Leu Glu His val lie Lys val Ala His Ala cys Leu His 
100 105 110 

Pro Leu Glu Pro Leu Leu Asp Thr Lys Cys Asp Ala Tyr Leu Gin Gin 
115 120 125 

Thr Gin Glu Leu val lie Leu Glu Thr lie Met Leu Gin Thr Leu Gly 
130 135 140 

Phe Glu lie Thr lie Glu His Pro His Thr Asp val val Lys Cys Thr 
145 150 155 160 

Gin Leu Val Arg Ala Ser Lys Asp Leu Ala Gin Thr Ser Tyr Phe Met 
165 170 175 

Ala Thr Asn Ser Leu His Leu Thr Thr Phe Cys Leu Gin Tyr Lys Pro 
180 185 190 

Thr val lie Ala Cys val Cys lie His Leu Ala Cys Lys Trp Ser Asn 
195 200 205 

Trp Glu lie Pro val Ser Thr Asp Gly Lys His Trp Trp Glu Tyr val 
210 215 220 

Asp Pro Thr Val Thr Leu Glu Leu Leu Asp Glu Leu Thr His Glu Phe 
225 230 235 240 

Leu Gin lie Leu Glu Lys Thr Pro Asn Arg Leu Lys Lys lie Arg Asn 
245 250 255 

Trp Arg Ala Asn Gin Ala Ala Arg Lys Pro Lys Val Asp Gly Gin val 
260 265 270 

Ser Glu Thr Pro Leu Leu Gly Ser ser Leu val Gin Asn ser lie Leu 
275 280 285 

val Asp ser val Thr Gly Val Pro Thr Asn Pro Ser Phe Gin Lys Pro 
290 295 300 

Ser Thr Ser Ala Phe Pro Ala Pro Val Pro Leu Asn ser Gly Asn lie 
305 310 315 320 

Ser Val Gin Asp Ser His Thr Ser Asp Asn Leu Ser Met Leu Ala Thr 
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325 330 335 

Gly Met Pro Ser Thr ser Tyr Gly Leu Ser ser His Gin Glu Trp Pro 
340 345 350 

Gin His Gin Asp Ser Ala Arg Thr Glu Gin Leu Tyr Ser Gin Lys Gin 
355 360 365 

Glu Thr Ser Leu Ser Gly Ser Gin Tyr Asn lie Asn Phe Gin Gin Gly 
370 375 380 

Pro Ser lie Ser Leu His Ser Gly Leu His His Arg Pro Asp Lys lie 
385 390 395 400 

Ser Asp His Ser Ser val Lys Gin Glu Tyr Thr His Lys Ala Gly Ser 
405 410 415 

Ser Lys His His Gly Pro lie Ser Thr Thr Pro Gly lie lie Pro Gin 
420 425 430 

Lys Met Ser Leu Asp Lys Tyr Arg Glu Lys Arg Lys Leu Glu Thr Leu 
435 440 445 

Asp Leu Asp val Arg Asp His Tyr lie Ala Ala Gin val Glu Gin Gin 
450 455 460 

His Lys Gin Gly Gin Ser Gin Ala Ala Ser Ser Ser Ser val Thr Ser 
465 470 475 480 

Pro lie Lys Met Lys lie Pro lie Ala Asn Thr Glu Lys Tyr Met Ala 
485 490 495 

Asp Lys Lys Glu Lys Ser Gly Ser Leu Lys Leu Arg lie Pro lie Pro 
500 505 510 

Pro Thr Asp Lys Ser Ala Ser Lys Glu Glu Leu Lys Met Lys lie Lys 
515 520 525 

val Ser Ser Ser Glu Arg His Ser Ser ser Asp Glu Gly Ser Gly Lys 
530 535 540 

Ser Lys His Ser Ser Pro His lie Ser Arg Asp His Lys Glu Lys His 
545 550 555 560 

Lys Glu His Pro Ser Ser Arg His His Thr Ser Ser His Lys His Ser 
565 570 575 

His Ser His Ser Gly Ser ser ser Gly Gly Ser Lys His Ser Ala Asp 
580 585 590 

Gly lie Pro Pro Thr val Leu Arg Ser Pro val Gly Leu Ser ser Asp 
595 600 605 

Gly lie Ser Ser Ser Ser ser ser Ser Arg Lys Arg Leu His val Asn 
610 615 620 

Asp Ala Ser His Asn His His Ser Lys Met Ser Lys Ser Ser Lys Ser 
625 630 635 640 

Ser Gly Gly Leu Arg Thr ser Gin His Leu val Lys Leu Asp Lys Lys 
645 650 655 

Pro val Glu Thr Asn Gly Pro Asp Ala Asn His Glu Tyr ser Thr Ser 
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660 665 670 

Ser Gin His Met Asp Tyr Lys Asp Thr Phe Asp Met Leu Asp Ser Leu 
675 680 685 

Leu Ser Ala Gin Gly Met Asn Met 
690 695 

(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2190 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 



ATGGCGTCGG 


GCCGTGGAGC 


1 ILI 1 C 1 CGC 


TGG 1 1 C 1 1 1 A 


CTCGGGAACA 


GCTGGAGAAC 


60 


ACGCCGAGCC 


GCCGCTGCGG 


AGTGGAGGCG 


GATAAAGAGC 


TCTCGTGCCG 


CCAGCAGGCG 


120 


GCCAACCTCA 


TCCAGGAGAT 


GGGACAGCGT 


CTCAATGTCT 


CTCAGCTTAC 


AATAAACACT 


180 


GCGA 1 1 G 1 I 1 


ATATGCACAG 


Gl 1 1 IATATG 


CACCATTCTT 


TCACCAAATT 


CAACAAAAAT 


240 


ATAATATCGT 


CTACTGCATT 


A 1 1 1 1 1 GGCT 


GCAAAAGTGG 


AAGAACAGGC 


TCGAAAACTT 


300 


GAACATGTTA 


TCAAAGTAGC 


ACATGCTTGT 


CTTCATCCTC 


TAGAGCCACT 


GCTGGATACT 


360 


AAATGTGATG 


CTTACCTTCA 


ACAGACTCAA 


GAACTGGTTA 


TACTTGAAAC 


CATAATGCTA 


420 


CAAACTCTAG 


Gill IGAGAT 


CACCATTGAA 


CACCCACACA 


CAGATGTGGT 


GAAATGTACC 


480 


CAGTTAGTAA 


GAGCAAGCAA 


GGATTTGGCA 


CAGACATCCT 


ATTTCATGGC 


TACCAACAGT 


540 


CTGCATCTTA 


CAACCTTCTG 


TCTTCAGTAC 


AAACCAACAG 


TGATAGCATG 


TGTATGCATT 


600 


CATTTGGCTT 


GCAAATGGTC 


CAATTGGGAG 


ATCCCTGTAT 


CAACTGATGG 


AAAGCATTGG 


660 


TGGGAATATG 


TGGATCCTAC 


AGTTACTCTA 


GAATTATTAG 


ATGAGCTAAC 


ACATGAG 1 1 1 


720 


CTACAAATAT 


TGGAGAAAAC 


GCCTAATAGG 


TTGAAGAAGA 


TTCGAAACTG 


GAGGGCTAAT 


780 


CAGGCAGCTA 


GGAAACCAAA 


AGTAGATGGA 


CAGGTATCAG 


AGACACCACT 


TCTTGGTTCA 


840 


TC 1 1 IGGTCC 


AGAATTCCAT 


TTTAGTAGAT 


AGTGTCACTG 


GTGTGCCTAC 


AAACCCAAGT 


900 


TTTCAGAAAC 


CATCTACATC 


AGCATTCCCT 


GCGCCAGTAC 


CTCTAAATTC 


AGGAAATATT 


960 


TCTGTTCAAG 


ACAGCCATAC 


ATCTGATAAT 


TTGTCAATGC 


TAGCAACAGG 


AATGCCAAGT 


1020 


ACTTCATACG 


Gl 1 IATCATC 


ACACCAGGAA 


TGGCCTCAAC 


ATCAAGACTC 


AGCAAGGACA 


1080 


GAACAGCTAT 


ATTCACAGAA 


ACAGGAGACA 


TC 1 1 1 G 1 C 1 G 


GTAGCCAGTA 


CAACATCAAC 


1140 


TTCCAGCAGG 


GACCTTCTAT 


ATCACTGCAT 


TCAGGATTAC 


ATCACAGACC 


TGACAAAATT 


1200 


TCAGATCATT 


CTTCTGTTAA 


GCAGGAATAT 


ACTCATAAAG 


CAGGGAGCAG 


TAAACACCAT 


1260 


GGGCCAATTT 


CCACTACTCC 


AGGAATAATT 


CCTCAGAAAA 


TGI CI 1 1AGA 


TAAATATAGA 


1320 
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GAAAAGCGTA 


AACTAGAAAC 


TCTTGATCTC 


GATGTAAGGG 


ATCATTATAT 


AGCTGCCCAG 


1380 


GTAGAACAGC 


AGCACAAACA 


AGGGCAGTCA 


CAGGCAGCCA 


GCAGCAGTTC 


TGTTACTTCT 


1440 


CCCATTAAAA 


TGAAAATACC 


TATCGCAAAT 


ACTGAAAAAT 


ACATGGCAGA 


TAAAAAGGAA 


1500 


AAGAGTGGGT 


CACTGAAATT 


ACGGATTCCA 


ATACCACCCA 


CTGATAAAAG 


CGCCAGTAAA 


1560 


GAAGAACTGA 


AAATGAAAAT 


AAAAG 1 1 1 C 1 


TCTTCAGAAA 


GACACAGCTC 


TTCTGATGAA 


1620 


GGCAGTGGGA 


AAAGCAAACA 


TTCAAGCCCA 


CATATTAGCA 


GAGACCATAA 


GGAGAAGCAC 


1680 


AAGGAGCATC 


CTTCAAGCCG 


CCACCACACC 


AGCAGCCACA 


AGCATTCCCA 


CTCGCATAGT 


1740 


GGCAGCAGCA 


GCGGTGGCAG 


TAAACACAGT 


GCCGACGGAA 


TACCACCCAC 


TGTTCTGAGG 


1800 


AGTCCTGTTG 


GCCTGAGCAG 


TGATGGCATT 


TCCTCTAGCT 


CCAGCTCTTC 


AAGGAAGAGG 


1860 


CTGCATGTCA 


ATGATGCATC 


TCACAACCAC 


CACTCCAAAA 


TGAGCAAAAG 


TTCCAAAAGT 


1920 


TCAGGTAGTT 


CATCTAGTTC 


TTCCTCCTCT 


GTTAAGCAGT 


ATATATCCTC 


TCACAACTCT 


1980 


G 1 1 1 1 1 AACC 


ATCCCTTACC 


CCTCCTCCCC 


TGTCACATAC 


CAGGTGGGCT 


ACGGACATCT 


2040 


CTGCACCTCG 


TGAAACTGGA 


CAAGAAGCCA 


GTGGAGACCA 


ACGGTCCTGA 


TGCCAATCAC 


2100 


GAGTACAGTA 


CAAGCAGCCA 


GCATATGGAC 


TACAAAGACA 


CATTCGACAT 


GCTGGACTCA 


2160 


CTGTTAAGTG 


CCCAAGGAAT 


GAACATGTAA 








2190 



(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 729 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 

Met Ala Ser Gly Arg Gly Ala Ser Ser Arg Trp Phe Phe Thr Arg Glu 
15 10 15 

Gin Leu Glu Asn Thr Pro Ser Arg Arg Cys Gly Val Glu Ala Asp Lys 
20 25 30 

Glu Leu Ser Cys Arg Gin Gin Ala Ala Asn Leu lie Gin Glu Met Gly 
35 40 45 

Gin Arg Leu Asn val Ser Gin Leu Thr lie Asn Thr Ala lie val Tyr 
50 55 60 

Met His Arg Phe Tyr Met His His Ser Phe Thr Lys Phe Asn Lys Asn 
65 70 75 80 

lie He Ser ser Thr Ala Leu Phe Leu Ala Ala Lys val Glu Glu Gin 
85 90 95 

Ala Arg Lys Leu Glu His val lie Lys val Ala His Ala Cys Leu His 
100 105 110 

Pro Leu Glu Pro Leu Leu Asp Thr Lys Cys Asp Ala Tyr Leu Gin Gin 
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115 120 125 

Thr Gin Glu Leu val lie Leu Glu Thr lie Met Leu Gin Thr Leu Gly 
130 135 140 

Phe Glu lie Thr lie Glu His Pro His Thr Asp val Val Lys Cys Thr 
145 150 155 160 

Gin Leu Val Arg Ala Ser Lys Asp Leu Ala Gin Thr Ser Tyr Phe Met 
165 170 175 

Ala Thr Asn Ser Leu His Leu Thr Thr Phe Cys Leu Gin Tyr Lys Pro 
180 185 190 

Thr val lie Ala Cys val Cys He His Leu Ala Cys Lys Trp Ser Asn 
195 200 205 

Trp Glu lie Pro val Ser Thr Asp Gly Lys His Trp Trp Glu Tyr val 
210 215 220 

Asp Pro Thr val Thr Leu Glu Leu Leu Asp Glu Leu Thr His Glu Phe 
225 230 235 240 

Leu Gin lie Leu Glu Lys Thr Pro Asn Arg Leu Lys Lys lie Arg Asn 
245 250 255 

Trp Arg Ala Asn Gin Ala Ala Arg Lys Pro Lys val Asp Gly Gin val 
260 265 270 

Ser Glu Thr Pro Leu Leu Gly Ser Ser Leu val Gin Asn Ser lie Leu 
275 280 285 

val Asp Ser val Thr Gly val Pro Thr Asn Pro Ser Phe Gin Lys Pro 
290 295 300 

Ser Thr Ser Ala Phe Pro Ala Pro val Pro Leu Asn ser Gly Asn lie 
305 310 315 320 

Ser val Gin Asp Ser His Thr Ser Asp Asn Leu Ser Met Leu Ala Thr 
325 330 335 

Gly Met Pro Ser Thr Ser Tyr Gly Leu Ser Ser His Gin Glu Trp Pro 
340 345 350 

Gin His Gin Asp Ser Ala Arg Thr Glu Gin Leu Tyr ser Gin Lys Gin 
355 360 365 

Glu Thr Ser Leu ser Gly Ser Gin Tyr Asn lie Asn Phe Gin Gin Gly 
370 375 380 

Pro Ser lie Ser Leu His Ser Gly Leu His His Arg Pro Asp Lys lie 
385 390 395 400 

Ser Asp His Ser Ser val Lys Gin Glu Tyr Thr His Lys Ala Gly Ser 
405 410 415 

Ser Lys His His Gly Pro lie Ser Thr Thr Pro Gly lie lie Pro Gin 
420 425 430 

Lys Met Ser Leu Asp Lys Tyr Arg Glu Lys Arg Lys Leu Glu Thr Leu 
435 440 445 

Asp Leu Asp val Arg Asp His Tyr lie Ala Ala Gin val Glu Gin Gin 
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450 455 460 

His Lys Gin Gly Gin ser Gin Ala Ala Ser Ser Ser ser val Thr Ser 
465 470 475 480 

Pro lie Lys Met Lys lie Pro lie Ala Asn Thr Glu Lys Tyr Met Ala 
485 490 495 

Asp Lys Lys Glu Lys Ser Gly Ser Leu Lys Leu Arg lie Pro lie Pro 
500 505 510 

Pro Thr Asp Lys Ser Ala Ser Lys Glu Glu Leu Lys Met Lys He Lys 
515 520 525 

val Ser Ser Ser Glu Arg His Ser Ser ser Asp Glu Gly Ser Gly Lys 
530 535 540 

Ser Lys His Ser Ser Pro His lie Ser Arg Asp His Lys Glu Lys His 
545 550 555 560 

Lys Glu His Pro Ser Ser Arg His His Thr Ser Ser His Lys His Ser 
565 570 575 

His Ser His Ser Gly Ser ser Ser Gly Gly ser Lys His Ser Ala Asp 
580 585 590 

Gly lie Pro Pro Thr val Leu Arg Ser Pro val Gly Leu Ser ser Asp 
595 600 605 

Gly lie Ser Ser Ser Ser Ser Ser Ser Arg Lys Arg Leu His Val Asn 
610 615 620 

Asp Ala Ser His Asn His His ser Lys Met Ser Lys Ser Ser Lys Ser 
625 630 635 640 

Ser Gly Ser Ser ser Ser Ser ser Ser Ser val Lys Gin Tyr lie Ser 
645 650 655 

Ser His Asn Ser val Phe Asn His Pro Leu Pro Leu Leu Pro Cys His 
660 665 670 

lie Pro Gly Gly Leu Arg Thr ser Gin His Leu Val Lys Leu Asp Lys 
675 680 685 

Lys Pro val Glu Thr Asn Gly Pro Asp Ala Asn His Glu Tyr ser Thr 
690 695 700 

Ser Ser Gin His Met Asp Tyr Lys Asp Thr Phe Asp Met Leu Asp Ser 
705 710 715 720 

Leu Leu Ser Ala Gin Gly Met Asn Met 
725 

(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2360 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 
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GGAAGTGCCT 


GCAACCTTCG 


CCGCTGCCTT 


CTGGTTGAAG 


CACTATGGAG 


GGAGAGAGGA 


60 


AGAACAACAA 


CAAACGGTGG 


TATTTCACTC 


GAGAACAGCT 


GGAAAATAGC 


CCATCCCGTC 


120 


GTTTTGGCGT 


GGACCCAGAT 


AAAGAACTTT 


CTTATCGCCA 


GCAGGCGGCC 


AATCTGCTTC 


180 


AGGACATGGG 


GCAGCGTCTT 


AACGTCTCAC 


AATTGACTAT 


CAACACTGCT 


ATAGTATACA 


240 


TGCATCGATT 


CTACATGATT 


CAGTCCTTCA 


CACGGTTCCC 


TGGAAATTCT 


GTGGCTCCAG 


300 


CAGCCTTGTT 


TCTAGCAGCT 


AAAGTGGAGG 


AGCAGCCCAA 


AAAATTGGAA 


CATGTCATCA 


360 


AGGTAGCACA 


TACTTGTCTC 


CATCCTCAGG 


AATCCCTTCC 


TGATACTAGA 


AGTGAGGCTT 


420 


ATTTGCAACA 


AGTTCAAGAT 


CTGGTCATTT 


TAGAAAGCAT 


AAI 1 1 IGCAG 


ACI 1 IAGGCT 


480 


TTGAACTAAC 


AATTGATCAC 


CCACATACTC 


ATGTAGTAAA 


GTGCACTCAA 


CTTGTTCGAG 


540 


CAAGCAAGGA 


CTTAGCACAG 


ACTTCTTACT 


TCATGGCAAC 


CAACAGCCTG 


CATTTGACCA 


600 


CATTTAGCCT 


GCAGTACACA 


CCTCCTGTGG 


TGGCCTGTGT 


CTGCATTCAC 


CTGGCTTGCA 


660 


AGTGGTCCAA 


TTGGGAGATC 


CCAGTCTCAA 


CTGACGGGAA 


GCACTGGTGG 


GAGTATGTTG 


720 


ACGCCACTGT 


GACCTTGGAA 


CI 1 1 IAGATG 


AACTGACACA 


TGAGTTTCTA 


CAGAI II IGG 


780 


AGAAAACTCC 


CAACAGGCTC 


AAACGCATTT 


GGAATTGGAG 


GGCATGCGAG 


GCTGCCAAGA 


840 


AAACAAAAGC 


AGATGACCGA 


GGAACAGATG 


AAAAGACTTC 


AGAGCAGACA 


ATCCTCAATA 


900 


TGATTTCCCA 


GAGCTCTTCA 


GACACAACCA 


TTGCAGGTTT 


AATGAGCATG 


TCAACTTCTA 


960 


CCACAAGTGC 


AGTGCCTTCC 


CTGCCAGTCT 


CCGAAGAGTC 


ATCCAGCAAC 


TTAACCAGTG 


1020 


TGGAGATGTT 


GCCGGGCAAG 


CGTTGGCTGT 


CCTCCCAACC 


1 ICI 1 ICAAA 


CTAGAACCTA 


1080 


CTCAGGGTCA 


TCGGACTAGT 


GAGAATTTAG 


CACTTACAGG 


AGTTGATCAT 


TCCTTACCAC 


1140 


AGGATGGTTC 


AAATGCATTT 


ATTTCCCAGA 


AGCAGAATAG 


TAAGAGTGTG 


CCATCAGCTA 


1200 


AAGTGTCACT 


GAAAGAATAC 


CGCGCGAAGC 


ATGCAGAAGA 


ATTGGCTGCC 


CAGAAGAGGC 


1260 


AACTGGAGAA 


CATGGAAGCC 


AATGTGAAGT 


CACAATATGC 


ATATGCTGCC 


CAGAATCTCC 


1320 


TTTCTCATCA 


TGATAGCCAT 


TCTTCAGTCA 


TTCTAAAAAT 


GCCCATAGAG 


GGTTCAGAAA 


1380 


ACCCCGAGCG 


GCCTTTTCTG 


GAAAAGGCTG 


ACAAAACAGC 


TCTCAAAATG 


AGAATCCCAG 


1440 


TGGCAGGTGG 


AGATAAAGCT 


GCGTCTTCAA 


AACCAGAGGA 


GATAAAAATG 


CGCATAAAAG 


1500 


TCCATGCTGC 


AGCTGATAAG 


CACAATTCTG 


TAGAGGACAG 


TGTTACAAAG 


AGCCGAGAGC 


1560 


ACAAAGAAGA 


GCGCAAGACT 


CACCCATCTA 


ATCATCATCA 


TCATCATAAT 


CACCACTCAC 


1620 


ACAAGCACTC 


TCATTCCCAA 


CTTCCAGTTG 


GTACTGGGAA 


CAAACGTCCT 


GGTGATCCAA 


1680 


AACATAGTAG 


CCAGACAAGC 


AACTTAGCAC 


ATAAAACCTA 


TAGCTTGTCT 


AG 1 ICI 1 1 1 1 


1740 


CCTCTTCCAG 


TTCTACTCGT 


AAAAGGGGAC 


CCTCTGAAGA 


GACTGGAGGG 


GCTGTGTTTG 


1800 


ATCATCCAGC 


CAAGATTGCC 


AAGAGTACTA 


AATCCTCTTC 


CCTAAATTTC 


TCCTTCCCTT 


1860 
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CACTTCCTAC AATGGGTCAG ATGCCTGGGC ATAGCTCAGA CACAAGTGGC CTTTCCTTTT 1920 

CACAGCCCAG CTGTAAAACT CGTGTCCCTC ATTCGAAACT GGATAAAGGG CCCACTGGGG 1980 

CCAATGGTCA CAACACGACC CAGACAATAG ACTATCAAGA CACTGTGAAT ATGCTTCACT 2040 

CCCTGCTCAG TGCCCAGGGT GTTCAGCCCA CTCAGCCCAC TGCATTTGAA TTTGTTCGTC 2100 

CTTATAGTGA CTATCTGAAT CCTCGGTCTG GTGGAATCTC CTCGAGATCT GGCAATACAG 2160 

ACAAACCCCG GCCACCACCT CTGCCATCAG AACCTCCTCC ACCACTTCCA CCCCTTCCTA 2220 

AGTAAAAAAA GAAAAAGAAG AGGAGAAAAA AACTTCTTTA AAAAAACACA TAAI I I I I CT 2280 

[MINING GGGAAAAAAA AAI I I I I I I I AAAAI I I I I I CCCCAAGGGA CGGGGGAAAA 2340 

TTTTA I I I I I AAAAI I I I I I 2360 

(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2181 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 



ATGGAGGGAG 


AGAGGAAGAA 


CAACAACAAA 


CGGTGGTATT 


TCACTCGAGA 


ACAGCTGGAA 


60 


AATAGCCCAT 


CCCGTCG 1 1 1 


TGGCGTGGAC 


CCAGATAAAG 


AACTTTCTTA 


TCGCCAGCAG 


120 


GCGGCCAATC 


TGCTTCAGGA 


CATGGGGCAG 


CGTCTTAACG 


TCTCACAATT 


GACTATCAAC 


180 


ACTGCTATAG 


TATACATGCA 


TCGATTCTAC 


ATGATTCAGT 


CCTTCACACG 


GTTCCCTGGA 


240 


AATTCTGTGG 


CTCCAGCAGC 


CTTGTTTCTA 


GCAGCTAAAG 


TGGAGGAGCA 


GCCCAAAAAA 


300 


TTGGAACATG 


TCATCAAGGT 


AGCACATACT 


TGTCTCCATC 


CTCAGGAATC 


CCTTCCTGAT 


360 


ACTAGAAGTG 


AGGCTTATTT 


GCAACAAGTT 


CAAGATCTGG 


TCAI 1 1 IAGA 


AAGCATAATT 


420 


TTGCAGACTT 


TAGGU 1 IGA 


ACTAACAATT 


GATCACCCAC 


ATACTCATGT 


AGTAAAGTGC 


480 


ACTCAACTTG 


TTCGAGCAAG 


CAAGGACTTA 


GCACAGACTT 


CTTACTTCAT 


GGCAACCAAC 


540 


AGCCTGCATT 


TGACCACATT 


TAGCCTGCAG 


TACACACCTC 


CTGTGGTGGC 


CTGTGTCTGC 


600 


ATTCACCTGG 


CTTGCAAGTG 


GTCCAATTGG 


GAGATCCCAG 


TCTCAACTGA 


CGGGAAGCAC 


660 


TGGTGGGAGT 


ATGTTGACGC 


CACTGTGACC 


TTGGAAC 1 1 1 


TAGATGAACT 


GACACATGAG 


720 


1 1 IUACAGA 


1 i 1 IGGAGAA 


AACTCCCAAC 


AGGCTCAAAC 


GCATTTGGAA 


TTGGAGGGCA 


780 


TGCGAGGCTG 


CCAAGAAAAC 


AAAAGCAGAT 


GACCGAGGAA 


CAGATGAAAA 


GACTTCAGAG 


840 


CAGACAATCC 


TCAATATGAT 


TTCCCAGAGC 


TCTTCAGACA 


CAACCATTGC 


AGGTTTAATG 


900 


AGCATGTCAA 


CTTCTACCAC 


AAGTGCAGTG 


CCTTCCCTGC 


CAGTCTCCGA 


AGAGTCATCC 


960 


AGCAACTTAA 


CCAGTGTGGA 


GATGTTGCCG 


GGCAAGCGTT 


GGCTGTCCTC 


CCAACCTTCT 


1020 
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TTCAAACTAG 


AACCTACTCA 


GGGTCATCGG 


ACTAGTGAGA 


ATTTAGCACT 


TACAGGAGTT 


1080 


GATCATTCCT 


TACCACAGGA 


TGGTTCAAAT 


GCATTTATTT 


CCCAGAAGCA 


GAATAGTAAG 


1140 


AGTGTGCCAT 


CAGCTAAAGT 


GTCACTGAAA 


GAATACCGCG 


CGAAGCATGC 


AGAAGAATTG 


1200 


GCTGCCCAGA 


AGAGGCAACT 


GGAGAACATG 


GAAGCCAATG 


TGAAGTCACA 


ATATGCATAT 


1260 


GCTGCCCAGA 


ATCTCC IMC 


TCATCATGAT 


AGCCATTCTT 


CAGTCATTCT 


AAAAATGCCC 


1320 


ATAGAGGGTT 


CAGAAAACCC 


CGAGCGGCCT 


1 1 ICIGGAAA 


AGGCTGACAA 


AACAGCTCTC 


1380 


AAAATGAGAA 


TCCCAGTGGC 


AGGTGGAGAT 


AAAGCTGCGT 


CTTCAAAACC 


AGAGGAGATA 


1440 


AAAATGCGCA 


TAAAAGTCCA 


TGCTGCAGCT 


GATAAGCACA 


ATTCTGTAGA 


GGACAGTGTT 


1500 


ACAAAGAGCC 


GAGAGCACAA 


AGAAGAGCGC 


AAGACTCACC 


CATCTAATCA 


TCATCATCAT 


1560 


CATAATCACC 


ACTCACACAA 


GCACTCTCAT 


TCCCAACTTC 


CAGTTGGTAC 


TGGGAACAAA 


1620 


CGTCCTGGTG 


ATCCAAAACA 


TAGTAGCCAG 


ACAAGCAACT 


TAGCACATAA 


AACCTATAGC 


1680 


TTGTCTAGTT 


CI 1 1 1 ICCTC 


TTCCAGTTCT 


ACTCGTAAAA 


GGGGACCCTC 


TGAAGAGACT 


1740 


GGAGGGGCTG 


TGI 1 1 GATCA 


TCCAGCCAAG 


ATTGCCAAGA 


GTACTAAATC 


CTCTTCCCTA 


1800 


AAI 1 ICICCT 


TCCCTTCACT 


TCCTACAATG 


GGTCAGATGC 


CTGGGCATAG 


CTCAGACACA 


1860 


AGTGGCCTTT 


CCTTTTCACA 


GCCCAGCTGT 


AAAACTCGTG 


TCCCTCATTC 


GAAACTGGAT 


1920 


AAAGGGCCCA 


CTGGGGCCAA 


TGGTCACAAC 


ACGACCCAGA 


CAATAGACTA 


TCAAGACACT 


1980 


GTGAATATGC 


TTCACTCCCT 


GCTCAGTGCC 


CAGGGTGTTC 


AGCCCACTCA 


GCCCACTGCA 


2040 


TTTGAATTTG 


TTCGTCCTTA 


TAGTGACTAT 


CTGAATCCTC 


GGTCTGGTGG 


AATCTCCTCG 


2100 


AGATCTGGCA 


ATACAGACAA 


ACCCCGGCCA 


CCACCTCTGC 


CATCAGAACC 


TCCTCCACCA 


2160 


CTTCCACCCC 


TTCCTAAGTA 


A 








2181 



(2) INFORMATION FOR SEQ ID NO: 50: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 726 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 

Met Glu Gly Glu Arg Lys Asn Asn Asn Lys Arg Trp Tyr Phe Thr Arg 
15 10 15 

Glu Gin Leu Glu Asn Ser Pro Ser Arg Arg Phe Gly vail Asp Pro Asp 
20 25 30 

Lys Glu Leu Ser Tyr Arg Gin Gin Ala Ala Asn Leu Leu Gin Asp Met 
35 40 45 

Gly Gin Arg Leu Asn Val Ser Gin Leu Thr lie Asn Thr Ala lie Val 
50 55 60 
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Tyr Met His Arg Phe Tyr Met lie Gin Ser Phe Thr Arg Phe Pro Gly 
65 70 75 80 

Asn Ser val Ala Pro Ala Ala Leu Phe Leu Ala Ala Lys Val Glu Glu 
85 90 95 

Gin Pro Lys Lys Leu Glu His val lie Lys Val Ala His Thr Cys Leu 
100 105 110 

His Pro Gin Glu Ser Leu Pro Asp Thr Arg Ser Glu Ala Tyr Leu Gin 
115 120 125 

Gin val Gin Asp Leu val lie Leu Glu Ser lie lie Leu Gin Thr Leu 
130 135 140 

Gly Phe Glu Leu Thr lie Asp His Pro His Thr His val val Lys Cys 
145 150 155 160 

Thr Gin Leu val Arg Ala Ser Lys Asp Leu Ala Gin Thr Ser Tyr Phe 
165 170 175 

Met Ala Thr Asn ser Leu His Leu Thr Thr Phe Ser Leu Gin Tyr Thr 
180 185 190 

Pro Pro val val Ala Cys val Cys lie His Leu Ala cys Lys Trp ser 
195 200 205 

Asn Trp Glu lie Pro val Ser Thr Asp Gly Lys His Trp Trp Glu Tyr 
210 215 220 

Val Asp Ala Thr val Thr Leu Glu Leu Leu Asp Glu Leu Thr His Glu 
225 230 235 240 

Phe Leu Gin lie Leu Glu Lys Thr Pro Asn Arg Leu Lys Arg lie Trp 
245 250 255 

Asn Trp Arg Ala Cys Glu Ala Ala Lys Lys Thr Lys Ala Asp Asp Arg 
260 265 270 

Gly Thr Asp Glu Lys Thr ser Glu Gin Thr lie Leu Asn Met lie Ser 
275 280 285 

Gin Ser ser ser Asp Thr Thr lie Ala Gly Leu Met Ser Met Ser Thr 
290 295 300 

Ser Thr Thr Ser Ala val Pro Ser Leu Pro val Ser Glu Glu Ser ser 
305 310 315 320 

Ser Asn Leu Thr ser val Glu Met Leu Pro Gly Lys Arg Trp Leu Ser 
325 330 335 

Ser Gin Pro Ser Phe Lys Leu Glu Pro Thr Gin Gly His Arg Thr Ser 
340 345 350 

Glu Asn Leu Ala Leu Thr Gly val Asp His Ser Leu Pro Gin Asp Gly 
355 360 365 

Ser Asn Ala Phe lie Ser Gin Lys Gin Asn Ser Lys Ser Val Pro Ser 
370 375 380 

Ala Lys val ser Leu Lys Glu Tyr Arg Ala Lys His Ala Glu Glu Leu 
385 390 395 400 
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Ala Ala Gin Lys Arg Gin Leu Glu Asn Met Glu Ala Asn val Lys Ser 
405 410 415 

Gin Tyr Ala Tyr Ala Ala Gin Asn Leu Leu Ser His His Asp Ser His 
420 425 430 

Ser Ser val lie Leu Lys Met Pro lie Glu Gly Ser Glu Asn Pro Glu 
435 440 445 

Arg Pro Phe Leu Glu Lys Ala Asp Lys Thr Ala Leu Lys Met Arg lie 
450 455 460 

Pro val Ala Gly Gly Asp Lys Ala Ala Ser Ser Lys Pro Glu Glu lie 
465 470 475 480 

Lys Met Arg lie Lys val His Ala Ala Ala Asp Lys His Asn Ser val 
485 490 495 

Glu Asp Ser val Thr Lys Ser Arg Glu His Lys Glu' Glu Arg Lys Thr 
. 500 505 510 

His Pro Ser Asn His His His His His Asn His His Ser His Lys His 
515 520 525 

Ser His Ser Gin Leu Pro val Gly Thr Gly Asn Lys Arg Pro Gly Asp 
530 535 540 

Pro Lys His Ser Ser Gin Thr Ser Asn Leu Ala His Lys Thr Tyr ser 
545 550 555 . 560 

Leu Ser Ser Ser Phe Ser Ser Ser Ser ser Thr Arg Lys Arg Gly Pro 
565 570 575 

ser Glu Glu Thr Gly Gly Ala val Phe Asp His Pro Ala Lys lie Ala 
580 585 590 

Lys Ser Thr Lys Ser Ser Ser Leu Asn Phe Ser Phe Pro Ser Leu Pro 
595 600 605 

Thr Met Gly Gin Met Pro Gly His Ser Ser Asp Thr Ser Gly Leu Ser 
610 615 620 

Phe Ser Gin Pro Ser cys Lys Thr Arg val Pro His Ser Lys Leu Asp 
625 630 635 640 

Lys Gly Pro Thr Gly Ala Asn Gly His Asn Thr Thr Gin Thr lie Asp 
645 650 655 

Tyr Gin Asp Thr val Asn Met Leu His Ser Leu Leu ser Ala Gin Gly 
660 665 670 

val Gin Pro Thr Gin Pro Thr Ala Phe Glu Phe val Arg Pro Tyr Ser 
675 680 685 

Asp Tyr Leu Asn Pro Arg ser Gly Gly lie Ser ser Arg ser Gly Asn 
690 695 700 

Thr Asp Lys Pro Arg Pro Pro Pro Leu Pro Ser Glu Pro Pro Pro Pro 
705 710 715 720 

Leu Pro Pro Leu Pro Lys 
725 
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(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) topology: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 
TTCCCACCAA TGCTTTCC 



(2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 
CCATCAGTTG ATACAGGGAT CT 



(2) INFORMATION FOR SEQ ID NO: 53: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 
GGAATTCAGA AGGTTGTAAG ATGC 



(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) type: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 
ACACACAGAT GTGGTGAAAT GTACCCA 



(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 
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GCATCTTACA ACCTTCTG 



(2) INFORMATION FOR SEQ ID NO: 56: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 
GGAATTCATG GAAAGCATTG GTGGGAAT 



(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 
(c) strandedness: single 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

CCTCCACTAC TGGTTTGCCT GG 



(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 
GGACTAGTAT AAATATGGCG TCGGGCCGTG 



(2) INFORMATION FOR SEQ ID NO: 59: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 
GGAGATCTTA CATGTTCATT CCTTGGG 



(2) INFORMATION FOR SEQ ID NO: 60: 
(i) SEQUENCE CHARACTERISTICS: 

(A) length: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 

GGAGACAAGT ATGTGCTACC TTGATGACA 



(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 
GGAATTCGGG CTGCTCCTCC ACTTTAG 



(2) INFORMATION FOR SEQ ID NO: 62: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 
GGAATTCGCT GCTGGAGCCA CAGAA 



(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 
GTGTCACTGA AAGAATACCG 



(2) INFORMATION FOR SEQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) topology: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 
GGAATTCAGG TGGAGATAAA GCTGC 



(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 
GCTCTAGATA AATATGGAGG GAGAGAGGAA 30 

(2) INFORMATION FOR SEQ ID NO: 66: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 
GGAATTCTTA CTTAGGAAGG GGTGGAAGTG 30 

(2) INFORMATION FOR SEQ ID NO: 67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 base pairs 

(B) type: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 
GGAATTCTTA CTTAGGAAGG GGTGGAAGTG GTGGAGGAGG TTAC 44 

(2) INFORMATION FOR SEQ ID NO: 68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 

Ala Cys Ser Tyr Ser Pro Thr ser Pro Ser Tyr Ser Pro Thr Ser Pro 
15 10 15 

Ser Tyr Ser Pro Thr Ser Pro Ser Lys Lys 
20 25 
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